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Abstract 



The threshold for fault-tolerant quantum computation depends on the available re- 
sources, including knowledge about the error model. I investigate the utility of such 
knowledge by designing a fault-tolerant procedure tailored to a restricted stochastic 
Pauli channel and studying the corresponding threshold for quantum computation. 
Surprisingly, I find that tailoring yields, at best, modest gains in the threshold, while 
substantial losses occur for error models only marginally different from the assumed 
channel. This result is shown to derive from the fact that the ancillae used in thresh- 
old estimation are of exceedingly high quality and, thus, difficult to improve upon. 
Motivated by this discovery, I propose a tractable algebraic algorithm for predict- 
ing the outcome of threshold estimates, one which approximates ancillae as having 
independent and identically distributed errors on their constituent qubits. In the 
limit of an infinitely large code, the algorithm simplifies tremendously, yielding a 
rigorous threshold bound given the availability of ancillae with i.i.d. errors. I use 
this bound as a metric to judge the relative performance of various fault-tolerant 
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procedures in combination with different error models. Modest gains in the thresh- 
old are observed for certain restricted error models, and, for the assumed ancillae, 
Knill's fault-tolerant method is found to be superior to that of Steane. My algorithm 
generally yields high threshold bounds, reflecting the computational value of large, 
low-error ancillae. In an efi^ort to render these bounds achievable, I develop a novel 
procedure for directly constructing large ancillae. Numerically, the scaling and aver- 
age error properties of this procedure are found to be encouraging, and, though it is 
not fault-tolerant, I prove that each error can spread to only one additional location. 
Promising means of improving the ancillae arc proposed, and I discuss briefly the 
challenges associated with preparing the cat states necessary for my procedure. 
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Chapter 1 
Introduction 



The field of quantum computation seeks to harness the processing power imphcit in 
the structure of quantum mechanics. To do so, however, requires the abihty to create 
and precisely control quantum mechanical states on large numbers of subsystems. 
This is a difficult task for the same reasons that macroscopic quantum effects such 
as superpositions are exceedingly rare. In order to manipulate a quantum system, 
it must be made to interact with its environment, but these interactions inevitably 
expose it to corruption from environmental factors over which we have imperfect con- 
trol. Thus, the production of a complex quantum state spanning many subsystems 
is unlikely to proceed flawlessly. 

Flaws need not be fatal, however. Through quantum coding, quantum informa- 
tion can be made robust against many kinds of error. Quantum codes store data in a 
distributed fashion over multiple quantum systems so that damage caused by errors 
on a small number of the systems is fully reversible. Because unencoded data is at 
the mercy of the elements, methods have been developed for applying operations and 
correcting accumulated errors without ever decoding the stored information. 

Not all ways of manipulating encoded data are equal. Encoded operations that 
spread errors between different parts of an encoded state are likely to cause irrevo- 
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cable damage since quantum coding is based on the fact that errors affecting many 
subsystems are improbable. Consequently, an important topic in quantum com- 
puting is the construction of encoded operations that avoid propagating errors, a 
property known as fault tolerance. 

Fault-tolerant design minimizes the impact of errors, but it does not guarantee 
that a computation will succeed. It is possible for the probability of error to be so 
high that an encoded computation with error correction is more likely to fail than 
an unencoded computation. If the unencoded error probability is below a certain 
threshold, however, encoding and error correction provide a means to implement an 
arbitrary quantum algorithm using resources that scale efficiently in the size and 
desired accuracy of the computation. This error probability is known, aptly enough, 
as the threshold for quantum computation. 

Knowledge of thresholds is clearly a crucial design criterion for use in the engi- 
neering of quantum computing architectures, but no simple, unified scheme exists 
for determining them. Information regarding the threshold for quantum compu- 
tation is obtained from explicit constructions of fault-tolerant procedures, and, as 
such, is strongly dependent on the particular procedure utilized. Moreover, to permit 
the broadest possible applicability, threshold calculations are generally performed for 
generic or worst case error models using fault-tolerant procedures designed to match. 
By contrast, actual implementations of a quantum computer are likely to suffer from 
errors that possess more structure. The initial impetus behind the work contained 
in this dissertation was the desire to determine whether quantum computing might 
be rendered more feasible by taking advantage of that structure. 

It is to this end that I embark in Chapter 3 on a program of tailoring the fault- 
tolerant procedure of Steane to a specialized, though somewhat unrealistic, error 
model which would seem to hold major promise for improving the threshold. There 
I develop procedures for constructing ancillary states (henceforth ancillae) and im- 
plementing error correction that suit the error model adopted, and I investigate the 
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performance of my tailored procedure by analytically bounding the threshold as well 
as by estimating its value. I also estimate the threshold using Steane's method, and, 
comparing the two, I find that, contrary to expectations, the advantage of my ap- 
proach over that of Steane turns out to be quite small. Moreover, I show through 
further numerical estimates that my tailored procedure is not robust against small 
perturbations in the error model, thereby severely restricting its applicability. To 
clarify the origin of this uninspiring achievement, I estimate the threshold assuming 
that ancillae with no errors whatsoever are available as a resource. Ancillae are sin- 
gled out because I expect the biggest impact of my modified fault-tolerant procedure 
to be in reducing ancillary errors, but, in fact, I find minimal improvement in the 
threshold even when ancillae are perfect. 

Motivated by the relatively minor role that errors on ancillae seem to play, I 
devote Chapter 4 to developing a method for determining the threshold given ancillae 
with simplified error distributions. For such ancillae, I am able to derive quite high 
bounds on the threshold for fault-tolerant quantum computation in the limit that 
the size of the code becomes large. I say "bounds" rather than "bound" because the 
technique is sufficiently simple that I apply it to a selection of different error models 
and fault-tolerant procedures. As in Chapter 3, I see only a small improvement 
in the threshold for substantially restricted error models. Comparing fault-tolerant 
procedures, I find that the method of Knill always performs best for ancillae of 
the form considered. In addition to their interpretation as bounds for idealized 
resources, I discuss the merit of these threshold results as a means of approximating 
the outcome of threshold estimation. In that mode, agreement with prior work is 
found to be tolerable, and a modified version of the method better suited to small 
codes is explained and vetted. 

Having found in Chapter 4 that large ancillae with simple error properties are a 
sufficient resource for computing at high rates of error, I devote Chapter 5 to address- 
ing the question of where ancillae with such nice error properties might come from. 
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I do so by proposing a scheme for preparing ancillae in a broad class of quantum 
states known as graph states. This is done through a completely novel technique that 
tracks the locations of some errors during construction and infers the existence and 
locations of others. I advance three different variants on my routine for interpreting 
error information, and investigate the fault-tolerance properties of two of them ana- 
lytically. Neither is found to be fault-tolerant, but the error spread associated with 
each is small. I also perform numerical studies on the error distributions of states 
constructed via this method which show that the average number of surviving errors 
is less than the average number of failures that occurred. The resource requirements 
of this approach are found, with some caveats, to compare favorably with those of 
more traditional procedures, and possible elaborations to deal with correlated errors 
are discussed. 

This dissertation only includes research that I have done which is pertinent to 
the themes of thresholds, fault tolerance, and atypical error models. Chapter 4 
covers material published in Reference [ I >^], and Chapter 5 deals with a body of work 
that should eventually coalesce into a paper on ancilla construction. Topics that 
I have collaborated on with other researchers are not included, but some of these 
have resulted in papers that are available online. Two papers on non-local hidden 
variables are published in PRA [56, 9], and a paper on graphical representations of 
stabilizer states is currently available in preprint form [19]. 
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Chapter 2 



Background 



2.1 Quantum States 

The fundamental difference between classical and quantum physics is that quantum 
mechanics is incompatible with a local, realistic description of the world. Classically, 
the state of a system may be unknown, and widely separated systems may be corre- 
lated in complex ways, but there always exists a description in terms of incomplete 
information about local, objective states. By contrast, quantum mechanics can be 
shown both in theory [10, 35] and in practice [-38, 46] to encompass situations in 
which either locality or realism must be abandoned to be consistent with observed 
measurement results. Thus, while a classical system, such as a coin, must possess 
a single well-defined classical state, e.g. heads, the quantum mechanical analog of a 
coin can be in any superposition of allowed states, e.g. 

\coin) = —=\heads) H — -^Itails) (2-1) 
V 2 V 2 

by which we mean that the state of the coin is actually \heads) and \tails) in equal 
parts. Superposition is subtle, measuring whether a quantum coin is in the state 
heads or tails always yields one result or the other, but clever combinations of mea- 
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surements on multiple quantum coins can be used to show that superposition differs 
fundamentally from classical uncertainty. Coins with these bizarre properties exist 
in nature in the form of spin-| particles and in theory in the form of qubits. 

Qubits are idealized two-state quantum systems for which, in analogy with clas- 
sical bits, the standard basis states are labeled |0) and |1) rather than \heads) and 
\tails). The term basis is appropriate here because, mathematically, the state of a 
qubit exists in a two-dimensional Hilbert space H, that is, a two-dimensional complex 
vector space with a Hermitian inner product. Distinct classical states are orthogonal 
under this inner product, so (0| 1) = 0. For simplicity, we additionally assume that 
|0) and |1) are normalized, (0|0) = (1|1) = 1. Thus, the states |0) and |1) form a 
orthonormal basis for single-qubit states, meaning that an arbitrary pure^ state lip) 
of a single qubit can be written as 

|^) = a|0)+/?|l) (2.2) 

where a and P are complex numbers such that lap and are the probabilities 
of a measurement in the standard basis finding the states |0) and |1) respectively. 
These being the only allowed outcomes, conservation of probability requires that 

The states |0) and |1) do not constitute the only possible basis for Ti of course. 
The result of applying any invertible linear map to a basis is another basis. Thus, 
for example, 

l+) = ^(|0) + |l)) |-) = ^(|0)-|1)) (2.3) 

is an equally valid basis for single-qubit states. 

For the purpose of measuring in this and other bases, it is convenient to introduce 
the notion of projectors. A projector 11 projects onto a subspace of the Hilbert space. 



^The term pure basically excludes any classical uncertainty regarding the state, the 
state vector is known. More will be said about purity momentarily. 
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annihilating components of states outside of that subspace. Since projecting a second 
time onto the same subspace has no additional effect, projectors satisfy 11 ■ 11 = 11. 
Given a normalized state \4>) the projector onto \4>) is defined as n|^) — \(f)){4>\- The 
probability of finding the state |0) given the initial state is 

(V'|%)|V'> = (V'|0)(0|V') = l(V'|</'>r (2.4) 

where I have used the property of the Hermitian inner product that {(f)\ip) — (■01 0)*- 

The outcome of the measurement of an arbitrary Hermitian operator can be 
expressed in terms of projectors as well. For an observable O with normalized eigen- 
vectors l^i) and 1^2) corresponding to eigenvalues Oi and 02, we say that, for the 
initial state l'^), the outcome oi is obtained with probability (■^ |n|6)j)| ■0) and the 
outcome 02 is obtained with probability (7/; |n|6i2>| i')- As before, and for any pro- 
jective measurement, the outcomes can also be regarded as finding the state vector 
corresponding to the projector. Conveniently, the expected value of a measurement 
on such a Hermitian operator is simply 

E(C>||V;)) =0i(V;|H,<,^>|V^)+02(V^|H|,,)|V;> 

(2.5) 

= (V' |oiH|ei) + 02H|e2) ^ {ip \0\ ijj) 

It is only with the faculty to measure in other bases that the distinction between 

superposition and probabilistic combinations becomes clear. Given the initial state 
|-|-), the probabilities of measuring |-|-) and |— ) are 

p+ = (+ |H|+>| +) = ( + 1 +) ( + 1 +) = 1 and (2.6) 

p_ = (+|H|_)|+> = ( + |-)(-|+) = 0. (2.7) 



By contrast, consider an initial state that is, with equal probability, either |0) or 
|1). Such a state is said to be mixed to distinguish it from pure states where no 
uncertainty exists in our knowledge of the state. Given the initial state |0), the 
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probabilities of measuring |+) and |— ) are 

p+ = (0 |n|+)| 0) = ^(0|(|0) + |1))((0| + (1|)|0) = ^ and (2.8) 
p_ = (0 |n|_)| 0) = ^(0|(|0) - |1))((0| - (1|)|0) = l- (2.9) 



Likewise, p+ = p_ = | for the initial state |1). Thus, we find that, for a state initially 
prepared in either |0) or |1) with equal probability, measuring in the basis {|+), |— )} 
yields the result |+) 50% of the time and the result |— ) 50% of the time. This is 
very different from the case of the initial state |+), a superposition of |0) and |1), for 
which such a measurement always finds the state |+). 

The manipulation of mixed states such as the one above is greatly simplified by 
introduction of the density matrix. Density matrices represent probabilistic mixtures 

of states by convex combinations of the projectors corresponding to each state. The 
weight of each projector is determined by the probability of the associated state. 
Thus, the density matrix p for the equal mixture of |0) and |1) described in the 
preceding paragraph is 

P=^|0)(0| + ^|1)(1|. (2.10) 

In addition to being compact, this notation has the advantage of lumping together 
all mixtures with the same measurement statistics. The maximally mixed state, the 
situation in which nothing is known about the state of the system, for instance, can 
be expressed as an equal mixture of any set of basis states. An equal mixture of |+) 

and |— ) has the same density matrix 

= ^(|0) + |1))((0| + (1|) + Im - |1))((0| - (1|) (2.11) 
= ^|0)(0| + ^|1)(1| 
as an equal mixture of |0) and |1). 
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The projective measurements we have discussed so far can also be carried out 
using density operators. Given an initial state p, and a set of projectors Ua, the 
probability of getting result a is given by 

p, = tr(n,p), (2.12) 

which, due to the cyclic nature of the trace and the fact that = Ua and (Ha)^ = Ua, 
is the same as tiijlaplll). 

This second form of Equation (2.12) is of interest because it also applies to more 
general kinds of measurements. In fact, for any set of measurement operators {Ea} 
such that ElEa = I, the probability of obtaining the measurement result a is 

Pa = tT{EapEl). (2.13) 

where the restriction Y2a -^I-^a ~ ^ insures that probability is conserved, 
J^Pa = $^tr (EapEi) = 5^tr (ElEap) 

a a a 



(2.14) 

tr(/p) = tr(p) = 1. 



Like projective measurements, general measurements typically disturb the state of a 
system. Subsequent to obtaining the measurement result a, the state is 

EapEj 
tl{EapEl) 



Until till now, I have only discussed a single qubit, but the richness of quantum 
mechanics emerges from the properties of composite systems. 

A collection of qubits in pure states can be represented by a tensor product of the 
individual states. Writing out tensor products is frequently unwieldy, so a variety of 
shorthand notations are employed. The equation 

|0)®|0) = |0)^^ = |0)J0), = |0)|0) = |00), (2.16) 
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displays five different ways of representing a pair of qubits eacli in tlie state |0). 

A pure state on n qubits is a vector in a 2"-dimensional Hilbert space Ti", so 
superposition (addition) works tfie same as in tlie single-qubit case. In terms of tlie 
component subsystems, any pure state on multiple qubits can be represented by a 
sum over tensor products of pure states on tlie individual qubits, e.g.. 



represents a pair of qubits in a superposition of the joint states |0)|1) and |1)|0). 

The tensor product is an appropriate choice for combining quantum systems 
because it respects the linearity of superposition, e.g., for complex coefficients a, /?, 
7, and S, 



One comforting physical implication of this is that a system which is in the same 
state in all terms of a superposition is unchanged by the superposition. 

The formalism regarding measurement and density operators described earlier 
carries over directly to collections of qubits, though the additional concept of the 
partial trace must be introduced. The partial trace provides a way of ignoring sub- 
systems that we do not wish to consider by tracing over their degrees of freedom. 
Tracing over the first qubit of the singlet state, {ip) from Equation (2.17), for instance, 
yields the reduced density operator 



on qubit 2. Thus, ignoring qubit 1 of the state of qubit 2 appears completely 
random. An identical situation holds for qubit 1 when we ignore qubit 2. 

The singlet state is one of an oft-used basis for two-qubit states known as the 
Bell basis; the complete set is defined by 




(2.17) 



(«|0) + /3|1))(7|0) + 5\1)) = a^\00) + a5\01) + Pj\10) + P5\ll). 



(2.18) 



p, = trmm = i(0| ^) (^1 0), + ^) (^1 1), = ^|0)(0| + ^|1)(1| (2.19) 




(2.20) 
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where a,b E {0, 1} and the symbol + within the state vector represents bitwise 
XOR. As for the singlet state, measuring a Bell state in the standard basis yields 
measurement outcomes for the individual qubits that are completely random when 
considered alone but perfectly correlated between one another. It is through these 
sorts of subsystem correlations that it is possible to verify the existence of non- 
classical effects. For any orthogonal pair of basis vectors |0) = q;|0) + /3|1) and 
\e) = (3*\0) - a*\l), for instance, 

^(10)1^) -1^)10)) 

= ;7|((«io) - mwio) + - mo) + «ii))(«io) - pm (2.21) 

= ^ + Wf) (|0)|1) - |1)|0)) = ^(|0)|1) - |1)|0)) = |/3n), 

so measuring both qubits of the state IPn) in any orthogonal basis yields perfectly 
anti-correlated measurement results. Classically such correlations are impossible, a 
fact that was proven by John Bell [; ]. States that, like the Bell states, cannot be 
expressed as a product of pure states on the constituent systems are referred to as 
being entangled. Entanglement is by no means well understood, but somehow this 
property divides the realms of quantum and classical physics. 



2.2 Quantum Gates 



Measurements are not the only way that we can interact with a quantum system. 
States evolve over time according to their Hamiltonian. By modulating that Hamil- 
tonian, it is possible to produce transformations on a state. In deference to nature, 
physicists typically treat such evolutions as continuous in time. In quantum informa- 
tion, however, as in computer science, the focus is on discrete changes of state. Thus, 
rather than dealing with Hamiltonians and continuous time, quantum information 
deals with quantum operations corresponding to discrete time steps. 
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A general quantum operation looks very much like a general measurement. In 
fact, the definition of a general measurement given in Section 2.1 encompasses quan- 
tum operations if we allow for the possibility that the results of some measurements 
are inaccessible. In such a case, the density operator is given by a sum over the den- 
sity operators corresponding to possible output states weighted by their probability. 
A quantum operation from which no information is learned (and the system is not 
destroyed) is called trace-preserving. A general trace-preserving quantum operation 
S has the form 



where E'^^E'' = I. 

As in computer science, the allowed gate set is often restricted in quantum infor- 
mation theory since the ability to apply arbitrary gates would make almost anything 
possible, divorcing the subject of quantum computation completely from reality. In- 
stead, a basic set of plausibly implementable gates is chosen, generally, single-qubit 
measurement in the standard basis and a selection of unitary gates that act on one 
or two qubits at a time. This discrete, computation-oriented model of quantum 
mechanics is commonly known as the quantum circuit model. 

The gates X, Y, Z, H, P, ^X, '"Z, X, and T (defined below) constitute the gate 
set Ug used in this dissertation. There is a great deal of redundancy in this set. 
Everything but the T gate can be constructed in a straightforward manner using 
the subset Cg = {H, P,'"X}. Moreover, the subset Ug = {H,'"X,T} permits not 
only a straightforward construction of the gates in Ug but, in a significantly less 
straightforward fashion that is described in Subsection 2.3.4, the construction of any 
unitary transformation on qubits. Nevertheless, it is frequently convenient to refer 
to the superfiuous gates in Ug, hence their definition below. 

X, Y, and Z are used to denote the Pauli spin operators. They are given in the 




(2.22) 



a 
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standard ({|0), |1)}) basis by 





1 




-i 


X = 










1 




i 



and 



Z ^ 



1 
-1 



(2.23) 



In addition to their role as gates, X, Y, and Z also serve the function of measurement 
operators with eigenvalues ±1 and eigenstatcs |0) ± |1), |0) ±i|l), and |0)/|1), respec- 
tively. Frequently, the bases corresponding to their eigenvectors are even referenced 
by the Pauli operator, e.g., the Z basis is the standard basis. 

Concordant with convention, 1 use H to denote the Hadamard gate and T to 
denote the | rotation (about the Z axis), also known as the | gate. The phase gate, 
which is the | rotation about the Z axis, I denote by P {S is frequently used in the 
literature as well) . These gates are given in the standard basis by 



H 



1 

71 



1 1 




1 




, p = 




1 -1 




i 



and T 



1 

e''^/^ 



(2.24) 



'"X refers to the controUed-NOT gate, a two-qubit gate that applies X to the 
target qubit conditional on the state of the control qubit being |1). The controUed- 
NOT gate is also sometimes called the controUed-X or XOR gate. The controUed-Z 
gate, ^Z, is a, similar two-qubit gate which applies Z conditional on the value of the 
control. These gates are written in the standard basis as 



'X 



10 

10 
1 
10 



and 



10 

10 
10 
-1 



(2.25) 



I use the remaining symbol in Ug, X, to denote the SWAP gate. The SWAP gate 
exchanges the state of two qubits, effectively relabeling them. In the standard basis 
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it is 



X 



10 
10 
10 
1 



(2.26) 



In addition to limiting the allowed gates, it is necessary to limit the permissible 
input and output states for a quantum computation. As described in the previous 
section, complex measurements command at least as much power as complex gates, 
so qubits arc required to be measured in the standard basis. Similarly, qubits are 
required to be initialized in the standard basis partly to avoid complex input states 
such as "the solution to my problem" . These restrictions also prevent us from over- 
looking a distinctly quantum mechanical problem, the physical difficulty involved 
in preparing initial states and performing measurements. Thus, the use of complex 
initial states or measurements should always be justified. 



2.3 Classes of Quantum Gates and States 

This section discusses classes of states and gates important to the field of quantum 
computation, including the gate groups generated by the gate sets Vg, Cg, and Ug. 



2.3.1 The Pauli Group 

The Pauli group is the subgroup of the group of unitaries U generated by the Pauli 
gates Vg — {X, y, Z}. Exphcitly, the Pauh group on n-qubits is 

P'' = {±l,±i} X (2.27) 

where the phases arise from products of Pauh operators such as ZX — iY. As 
for the single-qubit case, elements of this group are referred to as Pauli operators. 



Chapter 2. Background 



15 



I occasionally also refer to X-type or Z-type Pauli operators, by which I mean 
multi-qubit Pauli operators consisting only of / and the specified single-qubit Pauli 
operator. 

The un-phased singlc-qubit Pauli operators are Hermitian, so, in addition to being 
unitaries, many Pauli operators are also observables. In fact, we can divide the Pauli 
group into two parts V"" — U iP" where 

= ±{/, X, r, z}®" (2.28) 

and all elements of 'P" are observables that square to the identity. The set 'P" is 
not a group since products of elements can yield members of iP", e.g. ZX — iY. 
Like P", however, P" does have the property that it is closed under conjugation by 
elements oi Cg, a fact that will be of interest in the next section. 

Another useful partition of the Pauli group is P" = P" U iP" where 

P" = ±{/, X, iY, Z}®". (2.29) 

The Pauli operators in are all real, as, therefore, are their products, so the set 
V"^ is closed under multiplication and, consequently, a group. Unlike V"", however, 
P"' is not closed under conjugation by elements of Cg, e.g. PXP^ = Y. 

Finally, any of the partitions of the Pauli group also forms a basis for operators. 
This is most easily seen by showing that any 2" x 2" dimensional matrix L^'^\i,j) 
with L^''\i,jy'' = S; igSjh can be decomposed into a linear combination of n-qubit 
Pauli operators. 

For a 2 X 2 matrix 

L(i) (0, 0) = (7 + Z) /2 L(i) (0, 1) = (X + iY) /2 

L«(1,0) = (X-iF)/2 L(i)(1,1) = (/-Z)/2; (2.30) 

taking tensor products of the single-qubit L'^^\i,j) it is easy to obtain any L'^'"\i,j). 
The L^'"\i,j) form a basis for n-qubit operators with an arbitrary n-qubit operator 
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O being written as 

n— 1 n— 1 

= ^^O^^L(")(i,i) (2.31) 

which demonstrates that any O e x can be decomposed into elements of 
since each L'^"\i,j) can be decomposed into elements of V"^. 

2.3.2 Stabilizer States 

Previously I specified quantum states by sums of basis vectors with complex coef- 
ficients, but a state can also be specified as the eigenvector corresponding to some 
particular set of eigenvalues of a complete set of commuting observables. A especially 
convenient class of observables is the group of n-qubit Pauli operators. 

Elements of the Pauli group are a desirable choice for constructing complete sets 
of commuting observables for a number of reasons. Foremost is the fact that multi- 
qubit Pauli operators are simply tensor products of single-qubit Pauli operators and 
thus possess a description efficient in the number of qubits. Additionally, since the 
eigenvalues of each of the single-qubit Pauli operators are ±1, the eigenvalues of any 
n-qubit Pauli operator are also ±1. And, in a similar vein, any two n-qubit Pauli 
operators either commute or anti-commute since the elements of the single-qubit 
Pauli group all either commute or anti-commute. 

Using the Pauli group, we can define a ubiquitous and extremely useful class 
of quantum states known as the stabihzer states. The class of stabihzer states is 
defined as the set of states that can be specified as the simultaneous +1 eigenstate 
of some set Sg = {G^} of n independent, commuting Pauli group elements. The set 
Sg generates a subgroup S of the Pauli group known as the stabilizer of the state, 
and the individual elements, , are referred to as stabilizer generators. Stabilizer 
generator sets are not unique; replacing any generator with the product of itself and 
another generator yields an equivalent generating set. Thus, replacing G^ by G^G'^ 
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for k ^ j has no effect on the stabihzed state. An arbitrary product of stabihzer 
generators, that is, an arbitrary element of S, is called a stabilizer element. 

A useful (unnormalized) representation of the stabilized state is 

2-- 2- n + DM) (2-32) 

for any whose overlap with the stabilized state is non-zero. This state satisfies 
the eigenvalue conditions since 

2-" J2 = 2-" Yl ^1^) (2.33) 

AGS Aes ces 

for any B e S. 



Binciry Generator Matrix 

An alternative description of Pauli operators, and therefore of stabilizers, is provided 
by the binary or symplectic representation of the Pauli group. In the binary repre- 
sentation an arbitrary n-qubit Pauli operator A is expressed in terms of a pair of 
length n binary strings x{A) and z{A) such that 



xj{A) = 



z,{A) 



HA^ 


= X or A^ = 


= Y 


ifA^ 


= 7 or A^ = 


Z 


HA^ 


= F or A^ = 


-- Z 


ifA^ 


= 7 or A^ = 


X 



and 



(2.34) 



(2.35) 



where A^ is the jth Pauli operator in the tensor product for A and Xj{A) and Zj{A) 
are the jth bits of x and z. The resultant binary strings are typically placed side by 
side in a matrix or list with a vertical line between them; thus. 



hX2YsZ^ = [xihX^Y^Z^) \z{hX2Y^Zi)] 



110 



11 



(2.36) 
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1 





1 





1 
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1 









XXY, IZZ 

XYX, ZIZ 

YXX, ZZI 

-YYY, III 



Figure 2.1: The a) stabilizer generator, b) binary stabilizer generator matrix, and c) 
complete set of stabilizers for the three-qubit GHZ state. 



The natural inner product for such vectors is the symplectic inner product, which 
satisfies 

[x{A)\z{A)] ■ [x{B)\z{B)] = x{A) ■ z{B) + z{A) ■ x{B). (2.37) 

where the addition is performed modulo 2. It is a simple exercise to show that two 
Pauli operators commute if and only if their symplectic inner product is 0. 

Binary notation is particularly useful for manipulating stabilizer generators, which 
are arranged for the purpose in a split matrix where each row is the binary repre- 
sentation of a single generator. Figure 2.1 shows a stabilizer generator for the three- 
qubit GHZ state and the corresponding binary stabilizer generator matrix and set 
of stabilizers. 



2.3.3 The Clifford Group 

Of the gates introduced in Section 2.2, all but the T gate have the property that 
they normalize the Pauli group, which is to say that 

JJ-pnjj] ^ pn ^2.38) 

for U E C'g = {X, F, Z, if, P, '"Z, X}. This property is easily verified for any 
U E Cg = {H, P, '"X} by explicitly determining the result of conjugating by each H 
and P for every Pauli operator and by '"X for every pair of Pauli operators. The 
result can then be extended to the other gates in C'g or, indeed, an arbitrary sequence 
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of gates in Ci 



'g, by composition of Equation (2.38), e.g., for U,V E Cg, 



(2.39) 



Thus, the set Cg generates a subgroup^ of the group of unitaries U wherein each 
element takes Pauh operators to Pauh operators under conjugation. 

Compare this group with the Chfford group C G U, which is defined to be the 
normahzer of the Pauh group, that is, the set of all gates U such that Equation (2.38) 
holds. It is tempting to simply assert that the group generated by Cg is C; using the 
following lemma it is straightforward to prove that the gate set Cg suffices, up to an 
overall phase of i, to transform any non-identity Pauli operator into any other. 

Lemma 1. For any Pauli operator A s.t. ^ I, we can construct a unitary, U , s.t. 
U AU'^ oc Xj where U is composed exclusively of gates in Cg acting on non-identity 
elements of A. 

Conjugating by Hadamard and phase gates as necessary, transform A to A' such that 
A' consists only of X and / operators. Subsequently conjugating A' by ^Xjj. for all 
k ^ j such that A'^ = X yields a Pauli operator proportional to Xj. 

Lemma 1 shows that the group of unitaries generated by Cg includes, for any 
choice of A,B G P", unitaries U and V such that UAW oc Xj and VBV^ oc X^. 
Consequently, since Cg generates both X and Cg, it also includes W = V'^yijkU, for 
which WAW'^ oc B. The constant of proportionality can only be ±1 or ±i since 
Cg preserves P", and it can be reduced to either 1 or i by allowing the insertion of 
a conjugation by Zj after the conjugation by U . While there exists a sequence of 
gates converting any single Pauli operator into a multiple of any other, however, the 
process does not transform each Pauli operator independently, so, phases aside, it 
need not encompass every possible function on V. In fact, as shown below, unitarity 
forbids this. 



^Recall that H'^ = = H, ^Z'^ = ^Z^ = ^'Z, and p-i = pt = p3_ 
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Conjugation by any unitary U is an isomorphism, since the operation is both 
bijective, 

UBU^ = UAU^ iff B = A, (2.40) 

and a homomorphism, 

UBAU^ = UBU^UAUl (2.41) 

Among other things, this imphes that commutators are preserved, 
\UBU\ UAU^] = UBU^UAU^ - UAU^UBU^ 

(2.42) 

= U{BA - AB)U^ = U[B, A]U\ 

as are eigenvalues, 

tT{UBU^UUU^) = tiiUBUU^) = tr(5n). (2.43) 

Moreover, unitary conjugation is linear, 

U{aA + (3B)U^ = aUAU^ + (3UBUl (2.44) 

Clearly this rules out a variety of functions. It is, for instance, impossible to take 
Xi to Xi while at the same time taking Zi to X2. Nor can conjugation by a unitary 
take Xi to iXi or any other Pauli operator with imaginary eigenvalues, showing 
that the formerly described transformations on individual Pauli operators are the 
most general possible using Cg. These restrictions apply equally to C and the group 
generated by Cg, so they simplify rather than settle the question of equality. 

In what follows, I show that Cg generates the Clifford group C by giving an 
explicit routine for constructing a sequence of gates that implements an arbitrary 
linear isomorphism on P". The equivalence of C and the group generated by Cg 
was first shown by Gottesman [2-3, 24]. I present a different approach'^ developed 



■^It turns out that Aaronson and Gottesman have also proven the equivalence of C and 
the group generated by Cg in the manner shown here, [i] 
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by Carlton Caves and myself with the assistance of Andrew Silberfarb and Steven 
Flammia. 

An isomorphism is fully described by its effect on a complete basis since, taken 
together, Equations (2.40) and (2.41) imply that applying an isomorphism to a com- 
plete basis yields another complete basis. Thus, by enumerating the elements of 
the two bases, it is possible to divine the full isomorphism. Equation (2.42) shows 
that commutators are preserved, indicating that it might be wise, for the purpose 
of enumerating isomorphisms, to choose a form of basis with simple commutation 
properties. 

The Pauli group on n-qubits can be expressed in terms of an overall phase (which 
transforms trivially due to linearity) represented by the Pauli operator iJ and a 
basis of 2n elements of Vn- With regard to commutation properties, a particularly 
simple choice of basis would be one in which all basis elements commute. It is, 
however, impossible to choose more than n such independent, commuting n-qubit 
Pauli operators, as can be shown using the following lemma. 

Lemma 2. Given a group V s.t. FC = ±CF for every C,F G V, any element 
D eT) either commutes with every C E V or it anti- commutes with half of them. 

Write VasV = ^U-B where {D, C} = for all C G ^ and [D, C] = for all C e B. 
If A is non-empty (i.e. D anti-commutes with something) then, for any A E A, 
V = AA U AB is a partition of V such that the elements of AA commute with D 
and the elements AB anti-commute with D. Thus, A A = B and AB = A, implying 
that A and B have the same size. 

Following Preskill [ ], we can imagine picking independent, commuting Pauli 
operators from Vn sequentially. Given a set of — 1 n-qubit Pauli operators A = 
{^'' }j=i) the set of Pauli operators that commute with all of them forms a group A'^. 
Lemma 2 assures us that any element chosen from A'^ either commutes with every 
element of ^4"*" or anti-commutes with half of them. An element that commutes 
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with every element of A''~ is either proportional to / (a class of elements already 
chosen) or it is not independent of A. Thus, the number of available commuting 
observables decreases by half each time an independent Pauli operator is added to 
A. Having chosen k — 1 independent, commuting Pauli operators from Vn, the 
number of commuting Pauli operators remaining is 4""'"^/2^"^. Of these, 4 x 2^^^ 
are dependent, corresponding to all distinct choices of the four phases and the k — 1 
previously chosen elements of A. The number of available independent, commuting 
operators for the kth element of A is thus 4"+i/2'^'^^ — 4 x 2^'^^ which equals zero 
when k = n + 1. 

In lieu of a basis of commuting Pauli operators, consider a basis of 2n Pauli 
operators divided into two sets A = {A^}^^^ and B = {B^}'j^-^ such that [A-', A'^] = 0, 
[B^^B''] = 0, and [A\B''] = unless j = in which case {A\B''} = 0. Each of 
these sets stabilizes a state, and each element commutes with every other element 
except its mate in the other set. Figure 2.2 provides a concrete example of this paired- 
stabilizer form, which Carlton Caves rediscovered while counting Clifford operations. 
An earlier use of the formalism appears in Reference [1]. 

To verify that such a pair of stabilizers exists, imagine, after choosing the stabi- 
lizer A by the process outlined in the previous paragraph, that we proceed to pick 
the elements of B. Each B'' must commute with all A^ s.t. j ^ k and all previ- 
ously chosen B. A'' satisfies the same property, so both A'' and B'^ are among the 
^n+i^2"^i+'=^i _ 2™-fc+4 elements of the group that commutes with the other basis 
elements. From Lemma 2 we know that only half of the elements of this group anti- 
commute with A^, however, so there are 2"'~^~^^ choices for i?^, which equals 8 when 
k = n. 

This procedure also provides a way to count the number of linear isomorphisms. 
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(a) 



(b) 



X I I I 

I X I I 

I I X I 

I I I X 

I I I I 



I 
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X 



Z I I I I 

I Z I I I 

I I Z I I 

I I I Z I 

I I I I z 



X Z Z X I 

I X Z Z X 

X I X z z 

Z X I X z 

X X X X X 



I I I Y Y 

Y I I Y I 

I Y I Y I 

I I Y Y I 

Z Z Z Z Z 



Figure 2.2: The a) canonical stabilizer pair for 5 qubits and b) a pair of non-canonical 
5-qubit stabilizers. The left stabilizer in a) stabilizes |+)®^. The left stabilizer in b) 
stabilizes |+) of the 5-qubit code. 



The number of ways to pick A is 

fc=l ^ fc=l k=l 

Given A, the number of ways to pick B is 

n 

2^-^+3 _ 2"^+3'*-Efc=i k _ 2(n2+5n)/2 ^g-j 

k=l 

Thus, the total number of linear isomorphisms is 

n 

gSnJ-J^n ^2.47) 

k=l 

This is not actually the number of elements in the Clifford group, since, as discussed 
earlier, conjugation by a unitary cannot generate a phase of i. Half of the possible 
transformations on each basis element are unobtainable, so the number of elements 
in the Chfford group is at most (and I show below exactly) 



25nJ-j4 



n 

n 4k— I 



k=l 



k=l 



The preceding discussion shows that there exist bases for V"' consisting of il and 
a set of paired stabilizers. Isomorphisms, due to their preservation of commutation 
relations, take a basis in paired stabilizer form to another basis in paired stabilizer 
form. Since factors of i are not produced by the transformation in question, it 
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makes sense to restrict to paired stabilizers composed of elements from P", a group 
that is without additional phases. Thus, to show that the gate set Cg generates 
every isomorphism on it is sufficient to show that gates from Cg can be used to 
transform any paired stabilizer basis for into any other paired stabilizer basis for 
P". The primary result we need for this is the following lemma. 

Lemma 3. For any A,B E s.t. {A,B} = and B^ ^ I, we can construct a 
unitary, U , s.t. U AU'^ = Xj and UBU'^ = Zj where U is composed only of gates in 
Cg acting on locations where A and/or B have non-identity elements. 

Having started in P", Lemma 1 and the subsequent discussion indicate that we can 
construct a unitary V composed of gates in Cg such that B' = VBV'^ = Xj. From 
Equation (2.42), A' = V AV'^ anti-commutes with B' implying that = Y or Z. 
Conjugating by Hadamard and phase gates as necessary, transform B' to B" = Zj 
and A' to A" such that A" consists only of X and / operators. Next conjugate by 
""Xjk for all k ^ j such that A''' = X, converting A" to ±Xj. If —Xj is obtained the 
sign can be removed by conjugation by Zj. B" is unchanged since none of the gates 
transform Zj. As promised, no gates are applied to locations k where A^ = B'' = I. 

And finally, we see that all possible isomorphisms on paired stabilizers whose 
elements are taken from are generated by Cg. 

Theorem 1. For any matched pair of stabilizer generators A = {A^} and B = {B^} 
where A\ B'J G F", 3 a unitary, U, s.t. UAU^ = Xj and UB^W = Zj Vj where U 
is composed only of gates in Cg. 

Lemma 3 guarantees that there exists a unitary V composed of gates in Cg such that 
VA^V'< = Xk and VB^V^ = Z^. Conjugation by Xjk would then resuh in A'^ = Xj 
and B'-' = Zj. A'^-' = B'^-' = I for h ^ j since each of these Pauli operators must 
commute with Xj and Zj. This process can be repeated for all values of j without 
disturbing the previously transformed operators since their non-identity locations are 
identities for other Pauli operators. 
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This theorem shows that the Chfford group and the group generated by Cg im- 
plement the same set of transformations on P". Since forms a complete basis for 
operators on Ti" the two groups are, in truth, equal. 

2.3.4 Universality 

By all rights this section should contain a proof of the universality of the gate set 
Ug = {H, '"X, T}. However, due to time constraints and the fact that I lack anything 
new to add to the proof, I omit it. The basic idea is that an arbitrary n-qubit unitary 
can be decomposed into two-level unitaries [''], which can then be decomposed 
into gates and single-qubit unitaries [ ]. Arbitrary single-qubit unitaries can be 
approximated to any precision by judicious sequences of the gates H and T [11]. 
This last fact is shown by identifying a pair of such gate sequences corresponding 
to an irrational angle of rotation 6 about two orthogonal axes rii and n2 in the 3-D 
rotational representation of single-qubit unitaries. Euler's decomposition allows the 
desired rotation to be decomposed into rotations about rii and n2 each of which 
can be approximated to any accuracy by a multiple of 6. Readers desiring a full 
treatment of this universality proof are referred either to the references above, or to 
the rendition in the textbook by Nielsen and Chuang [37]. 

2.4 Quantum Circuit Diagrams 

Quantum circuit diagrams provide a pictorial method for representing the application 
of discrete operations, or gates, to a quantum system. Diagrams consist of horizontal 
lines, representing qubits, interrupted by squares and other decorations, representing 
discrete unitaries applied to the interrupted qubits, and optionally punctuated by 
any of a variety of symbols representing measurement. The order of operations is 
from left to right, with the initial state written to the left of the qubits (lines) it 
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Operation 
Measurement 



U 



12 



12 



21 



X 



Circuit element 



or 



U 



or 



X 



or 



or 



Table 2.1: A basic set of one- and two-qubit quantum operations and their cor- 
responding circuit elements. Here U is used to represent an arbitrary single-qubit 
unitary. Gates with multiple controls, such as the TofoUi, or a control on |0), indi- 
cated by an open dot, are also allowed, and, in general, complex multi-qubit unitaries 
are indicated by a large labeled box spanning the qubits acted on by the unitary. 



applies to. Classical data is denoted by double lines where double lines emanating 
from a measurement are assumed to carry the measurement value and double lines 
intersecting a unitary denote a classical control. As with electrical circuits, repeated 
usage of a small number of standard, simple parts results in quantum circuits that 
are easier to understand and implement. Table 2.1 depicts the standard one- and 
two-qubit quantum gates as rendered in this dissertation. An example quantum 
circuit, the teleportation circuit, is given in Figure 2.3. 

In addition to the normal advantages of a graphical depiction for visualization, 
quantum circuit diagrams provide a powerful tool for proving identities. Circuit 
diagrams clearly and concisely indicate the order of operations as well as which 
qubits each operation acts upon. This property, when augmented by a selection 
of simple circuit identities, permits the transformation of many circuits on a grand 
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H 



|0) 
|0> 



^ 



H 



X 



Figure 2.3: An example quantum circuit diagram showing the teleportation cir- 
cuit. In this circuit Hi"Xi2"X^2Hz is apphed to the initial state |-?/')|0)|0) yielding 
(|00)|^) + |01)X3|^) + 110)^31^/;) + |ll)X3Z3|^/;))/2. The first two qubits are then 
measured, and, conditional on the measurement results, corrective gates are applied 
to complete the teleportation, thereby yielding on the output. 



X — Z 



I — Z — X 



X 



X 



X 



uv 



H — X — H 



H — Y — H 



X 



ly^Y 
V 



X 



X 



^x 



^x 



z 



V 



V — u 



z 



^ z 



V 



u 



Figure 2.4: Frequently used circuit identities. U and V here represent arbitrary 
two-qubit unitaries. Gates applying a simple phase, such as i or —1, are included in 
the identities for use in breaking up controlled operations like alone, they merely 
impart an overall phase and can thus be omitted. 
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scale, without ever resorting to either matrices or state vectors. Figure 2.4 hsts the 
bulk of the simple circuit identities employed during the subsequent chapters, though 
it omits those that are powers of a gate such as ^Zj^^Zj^ = I and (P)^ = Z . 

2.5 Codes 

At its core, coding is the art of identifying sets of physical states that, under the 
influence of errors, are unlikely to transition between each other. A code is nothing 
more than a mapping between a set of such states, or codewords, and a set of logical 
(information) states that we wish to protect. No set of states is robust against every 
kind of assault, however, so some knowledge about the nature of the errors must be 
assumed. 

Much of coding theory is built around the very reasonable (and not uncommonly 
true) assumption that independent elements suffer errors independently. In this case, 
the probability that a pair of errors afflicts two elements is equal to the product of 
the probabilities of the errors afflicting the individual elements, or, more generally. 




(2.49) 



where Ej denotes an error on element j. This assumption ensures that the most 
probable errors are of those that affect the fewest elements. Put another way, the 
most probable error operators are those with the lowest Hamming weight, where the 
Hamming weight is defined as the number of non-trivial components in a string. 

Coding theory, both quantum and classical, are vast subjects, and I will not even 
begin to cover them in their entirety. Instead, this section touches upon the basic 
properties of binary, linear codes relevant to the task of protecting against all errors 
below a certain weight. 
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2.5.1 Classical Codes 

Protection against low-weight errors is achieved through the use of codes whose 
logical (encoded) operations have high weight, that is, codes that distribute logical 
information across many elements. Perhaps the most straightforward method of 
constructing a code with this property classically is to employ simple redundancy, as 
in the class of repetition codes where logical states are encoded as many copies 
of themselves. 

The smallest of the repetition codes is the two-bit repetition code ^2 for which 

= 00 1 = 11 (2.50) 

where I have placed a bar over logical states to identify them. The two-bit repetition 
code is capable of detecting a single bit error since flipping either the first or the 
second bit of a codeword yields something that is not a codeword. It is not, however, 
able to detect two errors since flipping both bits is the logical (encoded) bit-flip 
operation. Nor is the two-bit repetition code capable of correcting one-bit errors 
since flipping a single bit of either codeword yields a state that might have been 
produced by flipping a bit of the other codeword. 

In order to correct errors (while storing information) it is necessary to have three 
bits. The three-bit repetition code ^3 is defined by the codewords 

= 000 1 = 111. (2.51) 

In addition to detecting any two bit errors, this code can correct any single error 
since, for a single bit error, the initial value of the logical state can be recovered 
via majority polling. Figure 2.5 shows the state diagrams for the two and three-bit 
codes. 

Implicitly, the preceding example has associated error states with the nearest 
codeword, where the distance between two states is defined as the minimum weight 
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(a) (b) 




Figure 2.5: State diagrams for a) two and b) three bits. Connected states differ by a 
single bit flip. Consequently, the length of the shortest path between two states is also 
the smallest number of bit flips that can convert one into the other. When decoding 
the three-bit repetition code, the four states on the left side of b) are mapped to 
while the four states on the right are mapped to 1. 

of any operator that transforms one into the other. This manner of interpreting 
error information, known as minimum-distance decoding, is appropriate whenever 
low- weight errors are more probable than those of high weight. 

The number of errors that can be detected or corrected with certainty using 
minimum-distance decoding is determined by the minimum distance of the code. 
The minimum distance of a code is defined as the minimum weight of any of its 
logical operators. For linear codes, that is, those for which 

a + b — c implies a + b — c, (2.52) 

the minimum distance is equal to the weight of the lowest-weight codeword since 
that codeword is generated by applying the lowest-weight logical operator to the 
zero vector. A code with distance d can detect d~l errors since all non-trivial errors 
affecting ci — 1 or fewer bits result in a state that is not a codeword, but at least one 
error of weight o? is a logical operation and so cannot be detected. The same code 
can correct t — [^J errors since all errors of weight t or less yield distinct states, 
but there exists a pair of errors of weights [|J and [|] such that the two errors take 
two different codewords to the same state and are thus indistinguishable. 
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Finally, it should be noted that linear codes are frequently referenced by a triplet 
of parameters n, k, and d where an [n, k, d] code refers to an n-bit code with minimum 
distance d that encodes k logical bits. The two and three-bit repetition codes are 
[2, 1,2] and [3, 1,3] codes respectively. 



An [n,k,d] linear code forms a fc-dimensional linear subspace of the vector space 
and can, consequently, be specified by a set of k basis vectors or by a set of n — A; basis 
vectors of the orthogonal space. Let G he a. kxn matrix whose rows are independent 
codewords, and let H be an — A;) x n matrix whose rows are independent vectors 
orthogonal to every codeword. G and H are called the generator matrix and the 
parity-check matrix respectively and satisfy 



As is normal for binary linear codes, addition in Equation (2.53) is performed modulo 
2. Modular arithmetic engenders a strange kind of orthogonality; any binary vector 
of even weight, for example, is orthogonal to itself. Being bases, H and G are not 
unique, adding one row to another in either matrix yields an equivalent basis. 

The generator matrix provides a very efficient method of encoding logical infor- 
mation. Given an unencoded data vector d on Z2 the corresponding codeword d can 
be chosen to be 



Rather than enumerating 2^^ codewords and associating them with 2^ logical states, 
we enumerate a basis of k codewords and associate them with a basis of k logical 
states; linearity takes care of the rest. 

The parity-check matrix, by contrast, is most useful for what it tells us about 
states that are not in the code. The parity-check matrix provides an easy way to 



2.5.2 The Parity Check Matrix 



H G^ = 0. 



(2.53) 



d = d'^ -G. 



(2.54) 
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separate information about the codeword from information about the error. For any 
state a = d + e where d is a codeword and e is an error string 



since, by definition, codewords have zero inner product with H. The bit string 
H ■ e is known as the syndrome of the error e. The syndrome contains all available 
information about what error was present since any state with the same syndrome 
can differ from e only by a codeword, and codewords are undetectable errors. Each 
syndrome can thus be associated with an error, reducing the problem of syndrome 
decoding, that is, finding the most probable causative error, to a single call to a 2"^^^ 
entry look-up table. For minimum-distance error correction, the lowest-weight error 
with a particular syndrome is the one associated with it. 

The generator and parity-check matrices are also useful for constructing new 
codes, either from scratch or by modifying existing codes. Codes such as the Ham- 
ming codes and low-density-parity-check (a.k.a. Gallager) codes result from choosing 
parity-check matrices that satisfy certain properties. New codes can be generated 
from old by a variety of processes including extending, puncturing, concatenation, 
and taking the dual of the original code. Each of these topics are covered briefly 
below. 

The Hamming codes J^n are a family of [2' — 1,2' — 1 — /, 3] codes (/ > 1) 
invented by Richard Hamming [28]. The parity check matrix for a Hamming code 
is constructed by choosing the columns to be all non-zero binary strings of length 
/. This choice is desirable because it ensures that the resulting code has minimum 
distance d = 3. Thinking of error locations as selecting columns of H, the condition 
that an error is a codeword, and therefore undetectable, is that it selects columns of 
H that add to zero modulo 2. For the Hamming codes, states of weight 1 cannot be 
codewords because none of the columns is permitted to be the string of all zeroes. 
Nor can states of weight 2 be orthogonal to H; no two columns of H add to zero 




(2.55) 
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since the columns are distinct. H includes all non-zero strings as columns, however, 
so the sum of any two columns must also be a column, showing that d = 3. We 
have already seen a Hamming code. The three-bit repetition code introduced in the 
previous section is the Hamming code for / = 2. 

The term low-density-parity-check (LDPC) code refers to any code defined by a 
parity check matrix such that the number of O's in each row and column greatly ex- 
ceeds the number of I's. The initial work on LDPC codes by Gallager [20] concerned 
parity-check matrices with fixed row and column weights, though, at present, one or 
both of these constraints are frequently relaxed, with sparse random matrices being 
a popular choice for parity checks. The motivation behind this unusual construction 
is that large LDPC codes have the capacity to achieve very high encoding rates {k/n 
comparable to Shannon's limit [47, 32]) while still being efficiently decodable. The 
ability to efficiency decode syndrome information rapidly becomes important as the 
size of a code n increases. Nearly any randomly chosen linear code achieves the Shan- 
non limit, but the resources required to decode the syndrome for such a code scale 
exponentially in n. LDPC codes are interesting because there exists an algorithm, 
the behef propagation or sum-product algorithm, that permits them to be decoded 
approximately using only order n operations [32]. 

Any code can be extended by adding a column of O's and row of I's to the parity 
check matrix. If H is the n x m parity check matrix defining the initial code, then 
H' defines an extended code where 







if k = n + 1 and j < m 



H 



■/jk 



\ 1 



if j = m -|- 1 




H-''^ otherwise 



v 



Extending a code never reduces the distance since H' ■ e' = implies 




mod 2 



and 



H-e = 



(2.57) 
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where wt(e') is the weight of e' and ej = e'j for all j < n. In other words, an error e' 
that is undetectable by H' contains an error e that is undetectable by H. A similar 
argument shows that the distance is increased iff the distance of the original code 
is odd, as in the case of the Hamming codes. The extended Hamming codes have 
parameters [2', 2' — (Z + 1),4]; in a slight abuse of notation I subsume them under 
the label J^. 

A code is punctured by deleting one of its bits. In terms of the generator matrix 
G this is simply the deletion of a column. Clearly puncturing is likely to reduce 
the distance since deleting one non-zero bit of a weight d codeword yields a weight 
d—1 codeword. It is also clear, however, that puncturing cannot reduce the distance 
by more than 1. To express the action of puncturing the jth bit of a code using 
its parity-check matrix, it is helpful to first take products of rows such that the jth 
column of H contains at most one 1. Puncturing the code then deletes the row 
containing a 1 in column j and column j. 

A very different method of constructing new codes from old is concatenation. 
Given two codes and ^b, we can define a concatenated code ° consisting 
of the code ^b where each of the unencoded states composing ^b is replaced by the 
corresponding logical state (or codeword) of "t^a- If is an [n^, /ca, c^a] code and 
^B is an [ub, ks, cis] code then the code ° ^^a is an [n^ns, /cs, rfsd^] code. The 
distance rf^rfyi arises from the fact that an undetectable error on '^b^'^a corresponds 
to at least errors on the higher level code "^b each of which, again, if they are 
to be undetectable, correspond to at least errors in the underlying code ^a- 
The primary advantage of using a concatenated code instead of a larger single-layer 
(or block) code is that the difficulty of decoding grows slowly with the number of 
layers of concatenation. The ratio of distance to size, however, decays exponentially; 
concatenating ^a I times yields a code with distance to size of {dA/riAY ■ 

The dual of a code ^, denoted is the code corresponding to the linear 
subspace orthogonal to ^ . Thus, if G and H are the generator and parity check 
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[1 1] [1 1 1] 

(e) 

'10 10 10 1" 
110 11 
1 1 1 1 

1111111 

Figure 2.6: The generator matrix G and the parity check matrix H for a selection 
of codes. The matrices shown correspond to a) G = H for b) G for = Jif^, 
c) H for = ^3, d) H for e) G for and f) G = H for The three-bit 
repetition code ^3 is also the [3, 1, 3] Hamming code Jif^. The [7, 4, 3] Hamming code 
is dual containing while the two-bit repetition code ^2 and the [8, 4, 3] extended 
Hamming code Jifg are both self-dual codes. 

matrices of ^ then G' = H and H' = G are the generator and parity check matrices 
of 

The 7-bit Hamming code and its extension J^s crop up repeatedly through this 
this work. is what is known as a dual- containing code, meaning that Jif^-^ C 
or, equivalently, that the span of G contains the span of H. is self-dual, meaning 
that J^g^ = Jifs or, equivalently, that the span of G equals the span of H. Both J^-*- 
and J^s also have the property that the weight of all codewords is a multiple of 4, a 
property that will prove important in Section 2.6.2. The generator and parity check 
matrices of and J^s are shown in Figure 2.6 along with those for the repetition 
codes introduced in the previous section. 

2.5.3 Quantum Codes 

Quantum and classical coding are closely related, so much of the theory of classical 
coding can be grafted over to the quantum regime. The classical repetition code 
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^2, for example, can be used to define a quantum repetition code ^2 with encoded 

states 

|0) = |00) and |1) = |11). (2.58) 

Using this encoding, a single bit flip error X no longer interchanges logical states; 
instead, the operator X1X2 implements a logical bit flip X where, as before, the 
overhead line indicates a logical or encoded quantity. This code detects single bit- 
flip errors just like the classical code, but, in order to maintain superpositions, error 
checking must be performed in a particular way. Measuring the bit value, that is, 
measuring Z, of individual qubits of the encoded state is not an option since that 
would also measure the bit value of the encoded state. If, for example, the encoded 
state were initially 

|^> = ^(|0) + |i)) = ^(|00) + |ll)) (2.59) 

then after measuring Zi on the first qubit, which is equivalent to measuring it in the 
{|0), |1)} basis, the state would be 

,100) 50% of the time 
|V^'>=<| , (2.60) 

1 11) 50% of the time 

which is, whatever the result, completely classical. In order to detect errors without 
inducing collapse, we need to find a set of measurements that tells us nothing about 
the encoded state while still conveying all available error information. 

We saw just such a measurement in Section 2.5.2 where the syndrome obtained 
from the parity-check matrix was shown to contain error information exclusively. 
Physically, each row of the parity-check matrix corresponds to an operator in which 
the parity of a group of bits are measured. Classical bit measurements are equivalent 
to Z measurements quantum mechanically, so, for the example of measure- 
ment operator corresponding to the parity-check matrix [1 1] is ZxZ2- Measuring 
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Z1Z2 has no effect on tlie logical basis states since they are +1 eigenvectors of the 
operator, 



Z1Z2IOO) = |00) 



and 



Z1Z2III) = (-1)'|11) = 111). 



(2.61) 



Indeed, the check operator Z1Z2 completely defines our code, so that, for an initial 
state the post-measurement state l^^') is 




with probability yp 
with probability (i/j 



(2.62) 



where the outcome is specified by the measurement and U^x — |00)(00|-|-|11)(11| and 
n^x± — I — H^x are the projectors onto the codespace and the orthogonal space 
respectively. This illustrates another important part of quantum error correction, 
which is how coherent errors get dealt with. We might, after all, imagine errors that 
do not flip a bit entirely, but put it into a superposition of flipped and not flipped. A 
state initially prepared in {ip) = |0) = |00) might, for instance, suffer a coherent bit 
error such that the new state is \ip') = \J\ — a^lOO) + q;|01). Such errors look very 
worrisome because they appear to be continuous, and we are only extracting one bit 
of information. But after the measurement of Z^Z-i^ 



n 



1 00) with probability ('ip 



n 



n 



1 01) with probability (A) 



n 



(2.63) 



showing that the process of measuring whether there is a bit flip error or not projects 
1^') into either a state where there is a bit flip error or one where there is not. 



Along with superposition in quantum mechanics comes relative phase. Indeed, 
superposition is meaningless in the absence of well-deflned relative phases. Averaging 
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over all possible relative phases for a superposition state, say (|0) + e'^|l)) / \/2, yields 



1 

"47r 



27r 



PAVE=^ / rfe(|0)+e^^|l))((0|+e-'^l|) 







de + \i){i\ / do 



1, , , , 1, w , 



'° 2. 2. ^2.64) 

+ |0)(1| / dee-'^ + |1)(0| / dOe'' 
Jo Jo 

2' " ' 2' 

so a completely unknown relative phase looks identical to a classical mixture of 
the basis states. If we're going to store quantum data, we need to be able to detect 
phase as well as bit errors. As with bit errors, however, we don't need to worry about 
protecting against an arbitrary phase error; preventing sign-flip errors is sufficient. 
Sign-flip errors are generated by the operator Z, whose effect is clarified by shifting 
to a superposition basis such as |±). Z|±) = |=i=), thus, Z errors interchange the 
states 1+) and |— ) in just the same way that X errors interchange the states |0) 
and This similarity suggests that phase errors might be detected using a code 
^2 which has basis vectors |±) = |±±) and a logical sign fiip operator Z\Z2- As 
expected, a single sign fiip is detected by X1X2, the check operator for this code. 

Having learned how to correct both bit and phase errors, it only remains to 
combine these two skills so that we have a fully functional quantum code. Perhaps 
the most straightforward way to implement both bit and phase-error correction is to 
do them each separately. The code o detects X errors on the first level using 
Ml 

codestates) using ; its logical basis states are 



^ and detects Z errors one the second level (where the base components are M^ 



|±) = ^(|00)±|ll)r- (2.65) 

!^2°!^2 detects a single bit error and/or a single phase error. Quantum codes, such 
as this, that are created by concatenating a repetition code in the bases X and Z 
are called Shor codes after their inventor Peter Shor [48]. 
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Shor's original code was actually a concatenation of the three-bit repetition code, 
not the two-bit repetition code. The resulting 9-qubit code is capable of correcting 
one error and has the following logical basis states 

|0) = ^(|000) + |111))^' and |I) = ^ (|000) - . (2.66) 



Note that Shor's original [[9, 1,3]] code assigned the upper-level logical basis states 
differently than was done in the preceding paragraph. 

Finally, as with classical codes, a notation has been adopted for referring to 
quantum codes by a triplet of important parameters. The designation [[n, k, d]] is 
applied to a quantum code that encodes k qubits in n qubits such that the minimum 
distance (for any combination of X, Y, and Z errors) is d. 



2.5.4 Stabilizer Codes 



Though the previous section focused on states, quantum error correction, like the rest 
of quantum mechanics, can also be approached from the point of view of operators. 
In particular, the stabilizer state formalism [22, 13, 23] has proven to be a powerful 
tool for quantum coding. Section 2.3.2 showed that a set of n stabilizer generators 
specifies a unique quantum state on 7i". Similarly, a set of — /c stabilizers can be 
used to specify the codespace of a quantum code encoding k logical qubits on H"^. 
The logical states are specified implicitly by choosing an additional 2k Pauli operators 
that commute with the stabilizer of the code. These additional operators represent 
logical X and Z for each of the k encoded qubits and, as such, form a matched pair of 
stabilizers of the sort described in Section 2.3.3. The error correcting properties of the 
resultant code can be described completely in terms of the commutation properties 
of the stabilizer. 

Consider a code ^ whose logical states are simultaneous +1 eigenvectors of the 
set of m = n — k stabilizer generators {A^}. Because the are Pauli operators they 
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each either commute or anti-commute with other Pauh operators. Consequently, a 
logical state j-?/^) modified by a Pauli error E is, like I?/;) itself, an eigenvector of 
the stabilizer corresponding to a well defined set of measurement outcomes for the 
stabilizer generators A^, 

{ip\E^A'E\ip) = ±{'ip\\E\'^A'\i>) = ±{i>\A'\i>) = ±1. (2.67) 

The error E is detectable only if {i?,y4*} = for some A^. Together with the 
stabilizer generators, the logical Pauli operators provide a basis for the group of 
undetectable errors. Errors that are elements of the stabilizer group, A^, are harmless 
since A*]-?/') = I?/)), but the errors corresponding to logical operations irreparably 
damage the encoded state. 

As for classical error correction, a set of errors S = {E^} is distinguishable if no 
pair of them yield the same state, that is if E^'^E^ is detectable for all E"^, E^ G S. 
Most classical codes, however, fail to correct pairs of errors that they cannot dis- 
tinguish since applying the wrong correction completes a logical operation changing 
the encoded state. By contrast, quantum codes, like classical secret-sharing codes, 
utilize logical states that are unaffected (or stabilized) by certain non-trivial opera- 
tions. Application of a stabilizer has no effect on the logical state, so a set of errors 
S = {E^} is correctable if E^'^ E'' is either a stabilizer or a detectable operation for 
all E\ E^ e £. 

As can be seen most clearly by considering the binary representation, the sta- 
bilizer generator is closely related to the classical parity-check matrix. In addition 
to the independence requirements satisfied by the parity checks, however, the stabi- 
lizer generators must also satisfy the property of commutativity. In binary notation, 
commutativity equates to orthogonality under the symplectic inner product defined 
in Section 2.3.2, and the symplectic inner product also replaces the standard one in 
determining the detectable error set. The analogy between parity-check matrices and 
stabilizer generators is most apt when applied to the operators that generate errors 
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and logical gates rather than to states, reflecting the stronger distinction between 
states and operators in quantum coding. 

The quantum bit-flip (sign-flip) codes discussed earlier are stabilizer codes where 
the stabilizer generators are obtained from the rows of the parity-check matrix by 
placing an / in the generator at locations where there are O's in the corresponding row 
of the parity check matrix and a Z {X) where there are I's. The stabilizer generators 
of a concatenated code o follow simply from replacing the Pauli operators in 
the stabilizer generator of by the corresponding logical Pauli operators of and 
adding the stabilizer generators of on the flrst level groups of qubits. The need 
for this second step is perhaps most easily understood by thinking of the stabilizer 
generator as the generator of the identity elements of a code, in which case, the 
inclusion of the stabilizer generators of 'ipa in the stabilizer generators of ° 
amounts to the inclusion of the logical identity operations for the code ^a- 

Quantum codes based on classical binary constructions do not make full use of the 
stabilizer formalism. A quantum code need not correct X and Z errors separately. 
In fact, the smallest quantum error-correcting code, a [[5,1,3]] code known as the 
5-qubit code, does not have this property. Figure 2.7 shows the stabilizer generators 
for the 5-qubit code ^5 and the codes , and o covered in the last 
section. 

2.5.5 CSS Codes 

The Shor codes, introduced in Section 2.5.3, are constructed by concatenation of a 
quantum bit-flip code whose stabilizers contain only Z's and the quantum sign-flip 
code obtained by exchanging Z for X in the stabilizers of the bit-flip code. Another 
class of quantum codes that obeys a kind of X-Z symmetry is the Calderbank-Shor- 
Steane (CSS) codes [14, 51]. CSS codes are deflned as quantum codes for which there 
exists a set of stabilizer generators consisting of two sub-generators, one containing 
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Figure 2.7: Stabilizer generators for a) b) c) o and d) ^, 
and are simply the classical two-bit repetition code in the X and the Z bases. 
The stabilizers generators for ^ 
in the stabilizer generators of L 



are constructed by replacing the X operators 
hy X ® X^ the logical X operation for and 



adding the stabilizer generators of on the first level pairs of qubits. The 5-qubit 
code ^5 does not have such a simple relation to classical binary codes. 



H 








H 



only X-type Pauli operators and the other obtained from the first by exchanging 
X for Z . Using binary notation, this is the statement that the stabilizer generator 
matrix can be written in the form 

(2.68) 

where H ■ H""" = 0, ensuring commutativity. The class of CSS codes can be gener- 
alized by dropping the symmetry requirement while retaining the segregation of the 
generators. Generalized CSS codes permit independent detection of X and Z errors 
and have stabilizers whose generator matrices can be written in the form 



(2.69) 



where H ■ F""" = 0. Of the quantum codes we have encountered thus far, all but the 
5-qubit code are generalized CSS codes. 

The canonical, and original, example of a CSS code is the Steane code, a [[7, 1, 3]] 
code named in honor of its inventor, Andrew Steane [HO]. As shown in Figure 2.8, 
the binary matrix H for the Steane code is the parity check matrix for the classical 
Hamming code J^-j. The logical X and Z operators of the code are chosen to be 



H 








F 



X — X1X2X3X4X5X6X7 



and 



Z — ZxZ^Z'^Zi^Z^Z^Z'j^ 



(2.70) 
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Figure 2.8: A stabilizer generator for the Steane code in a) standard and b) binary 
form. 

making the code fully symmetric under the interchange of X and Z. The Steane code 
is the smallest of the error-correcting CSS codes and probably the most widely used 
of all quantum codes. Its popularity derives both from its size and the convenient 
properties of CSS codes for fault-tolerant quantum computing, a topic covered in 
Section 2.6. 

2.5.6 Non-Pauli Errors 

Up till now I have considered only a very limited set of errors, but the variety of things 
that might go wrong with a quantum system is great; an error might take the form 
of any quantum operation. As discussed Section 2.2, an arbitrary trace-preserving^ 
quantum operation £^ on a state p can be written as 

Sip) = J2E''pE-^ (2.71) 

a 

where the error operators E"" need only satisfy E"-'^ E"- = I. 

In the face of such a general error model, it would seem unlikely that correcting 

Pauli errors should be sufficient to guard quantum data. Recall from Section 2.3.1, 

however, that an arbitrary operator on x Ti" can be expanded in terms of elements 

"^Trace-decreasing errors that correspond to loss of the physical system, e.g. an ion 
falling out of a trap or decaying into an inaccessible state, are not treated here. Interested 
readers are referred to References [6, 29, 36, 40, 23]. 
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of the Hermitian n-qubit Pauli operators P". Consequently, each error operator can 
be written in terms of = {A^} as 

E- = J2c^a^A' (2.72) 

i 

where the complex coefficients. The action of £ can thus be expressed as 

£ip) = Y,aa^a:^A^pA^. (2.73) 

aij 

For the special case "^a^aiO^aj = lAP'^jj; Equation (2.73) reduces to 
S{p) = J2m'A'pA\ (2.74) 

i 

Equation (2.74) defines a multi-qubit stochastic Pauli channel, that is, a channel 
that can be interpreted as applying the Pauli operator A* to p with probability 
Stochastic Pauli channels differ from the more general case in that there are no 
coherences between Pauli errors. Either error A^ happens or it doesn't; the qubits 
never suffer from a superposition of errors A^ and A^ . Consequently, everything we 
have learned about error correction up till now applies. For p initially encoded in an 
[[n, k, d]] code each A* with weight less than d corresponds to a detectable error 
and each A^ with weight of at most t = [{d — 1)/2J corresponds to a correctable 
error. 

The error operation given by Equation (2.73) does entail coherences between Pauli 
errors, but these, like any other kind of coherence, are susceptible to destruction by 
measurement. Moreover, projective measurements of the stabilizer generators are 
well suited to orchestrating such a collapse since the measurements are designed to 
distinguish between different kinds of Pauli errors. Given a set of stabilizer generators 
Sg = {C*} for the code the projectors corresponding to each are (/ ± C*)/2. 
Thus, having projectively measured each C* and obtained the values q, the pre- 
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measurement state p' = S{p) is transformed to 



p oc 



k / aij \ I 




where C'^A^ = Xki^^C^. Since p is in the codespace stabihzed by {C^} the projectors 
(/ — C^)/2 annihilate p while the projectors (/ + C^)/2 have no effect, 

(2.76) 

Thus, the only terms in Equation (2.75) that survive are those with i and j such 
that Xhi = Xhj = Ch for all h, that is, those for which and have the measured 
syndrome {ch}- The post-measurement state is therefore 

P" « E E (2-77) 

where T = {i\AC'' = CkC^A \/k}. 

Depending on the error operation, measuring the stabilizer generators can some- 
times be sufficient to remove all harmful coherences. Let w be the weight such that 
w > max(wt(i?°)) for all error operators E"" of the quantum operation S, where 
wt(£'") refers to largest weight of any Pauli operator in the decomposition of -E". 

Error detection works flawlessly when w < d, since, by definition, all A^ such that 
wt(A*) < d and [A^, C''] = for all k are in Sg. In other words, any Pauli operator 
that commutes with all of the stabilizers and satisfies the weight restriction must be 
a stabilizer itself, and therefore harmless to the encoded state. 

Likewise, when w < t = \_{d—l)/2\ then error correction succeeds since A^A^ G Sg 
for all A^ and A^ with the same syndrome and satisfying wt(y4*), wt(y4-') < t. In 
words, any Pauli errors with the same syndrome and satisfying the weight restrictions 
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differ by at most an element of tlie stabilizer and are therefore equivalent for the 
purposes of error diagnosis. 

More typically, w is greater than the maximum error weight that the code can 
tolerate, so error management may not succeed perfectly. In this case, the probability 
of success can be quantified by the probability that a projective measurement on the 
actual state should find the ideal state. This metric is often called the fidelity (though 
so is its square root), and is defined as 

F{\e),cx) = {e\a\e) . (2.78) 

for a state |^) and density matrix a. Taking my initial state to be pure, p = 
and letting p'" denote the state of p" after error correction (if we're correcting), the 
probability of failing to obtain p'" = is given by 1 — F (|0i),p"'). Since p'" is 

normalized, however, 

1 = tr(p'") = J2 \P"'\ = mi),P"') + E \P"'\ "t"^) (2.79) 

h h^l 

where is an orthonormal basis. Applying Equation (2.79), the failure proba- 

bility is given by 

Vifailure) = 1 - F (|</)i), p'") = J] (0, \p"'\ 0,) . (2.80) 

Now let V be the maximum error weight that the code can tolerate, either d — 1 
or t depending on whether error correction or detection is being performed. The 
terms of p" that wind up contributing to the failure probability in Equation (2.80) 
are those for which both A* and have weight greater than v. This is because, for 
all other terms of p", the corrective action taken is appropriate for one of the two 
Pauli errors, thus yielding a term in p'" where one of the state vectors is and 
therefore orthogonal to all 

For independent, local errors with amplitude y/p, each non-identity element in 
a Pauli error brings an additional factor of ^Jp. Consequently, terms in p" with 
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wt(y4*), wt(y4-') > V occur with probability of at most order p^, showing that the 
failure probability for independent, local errors goes like, at worst, p^. 

2.6 Fault Tolerance 

Our discussion of error correction was founded upon the assumption that errors on 
different components are independent, and, thus, that the most likely errors are those 
of low weight, that is, those affecting few qubits. Errors are never perfectly isolated, 
however, so, even should the independence assumption hold for the creation of er- 
rors, there is nothing to guarantee that errors remain independent as a computation 
proceeds. Indeed, quantum computers are particularly prone to spreading errors. 
Contrary to the case of classical computing, where two-bit gates can be made essen- 
tially unidirectional, the unitary two-qubit gates employed for quantum computing 
are inherently bidirectional, that is, they transfer information, and therefore errors, 
between the qubits acted upon in both directions. However errors are generated, 
if they are permitted to spread indiscriminately the final error distribution will vi- 
olate the independence assumption necessary for making minimum distance error 
correction work. 

Fault-tolerant design [16, 49, 40, 2] is an approach to quantum computing that 
seeks to minimize the spread of errors. Formally we can define fault tolerance as 

Fault tolerance The property that errors affecting r < w components involved in 
an operation cannot result in more than r errors on a single encoded block. 

where w is some weight of interest, typically, the number of correctable errors. The 
focus on individual encoded blocks reflects the fact that encoded blocks are corrected 
independently. Spreading errors between blocks increases the frequency of errors on 
individual qubits of a block, but spreading errors within a block generates correlated 
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block errors from uncorrelated ones. 

To clarify the distinction, consider the case of error detection using the two- 
qubit quantum bit-flip code ■ Suppose that we have a logical qubit prepared in 
the state |0) = |00) that has suffered, with probability p, an X error on the first 
encoding qubit; the corresponding density operator is 

pi = (l-p)|00)(00|+p|10)(10|. (2.81) 

At worst. Pi has a single, detectable error; this occurs with probability p. Given a 
second copy of |0) whose second encoding qubit is similarly uncertain, the joint state 
of the four qubits can be written 

p =((1 -p)|00)(00| +p|10)(10|) (8) ((1 -p)|00)(00| +p|01)(01|) 

= (1 -p)'(|00) ® |00))((00| ® (00|) -j9)(|00) ® |01))((00| ® (01|) 

+ -p)(|10) |00))((10| ® (00|) +p'(|10) ® |01))((10| (g) (01|). (2.82) 

Now let a '~^X gate be apphed from the second qubit of the second code block to 
the second qubit of the first code block. The resulting state is 

P — -^22,12P -^22,12 

= (1 -j9)2(|00) ® |00))((00| (g) (00|) +p(l -p)(|01) ® |01))((01| ® (01|) 
+ p(l -p)(|10) ® |00))((10| ® (00|) +p'(|ll) ® |01))((11| ® (01|). (2.83) 

Tracing over the second logical qubit yields 
Pi = tr-2{p') 

= (1 -p)2|00)(00| +p(l -p)|01)(01| +p(l -p)|10)(10| 

(2.84) 

showing that, while the total probability of an X error on qubit 2 has increased to 
roughly p, errors on the two qubits can still be considered independent since the 
probability of an X error on both qubits is p^. 



Chapter 2. Background 



49 



By contrast, suppose that we had apphed a X gate between the first and second 
qubits of pi- The final state is then 



which suffers from two X errors with probabihty p. By applying the X gate between 
qubits in the same encoded block, we have converted a single, detectable bit error 
into a pair of bit errors that occur with the same probability and are undetectable. 

In addition to demonstrating the importance of fault tolerance, the preceding 
example suggests the most straightforward method of achieving it, which is through 
the avoidance of gates that couple qubits within the same encoded block. For some 
codes a surprising number of encoded operations can be applied in this manner. In 
this dissertation, I refer to all such operations as transversal, a designation differing 
from some of the literature, where the term transversal is assumed additionally to 
imply that the same component operation is applied to each qubit of a block. I use 
the term homogeneous is used to specify operations satisfying both properties. Sub- 
section 2.6.2 describes the encoded operations that can be implemented transversally 
for CSS codes. 

Performing encoded gates fault-tolerantly does not allow us to avoid error correc- 
tion; even errors that build nicely must be corrected periodically lest they become 
overwhelming. As explained in Section 2.5.3, however, quantum error correction re- 
quires the measurement of non-local check operators, the results of which are then 
used to infer the location of errors on the data. This introduces two possible paths 
by which errors might spread. The first is that the process of determining the value 
of the check operators, which necessarily involves combining information from mul- 
tiple qubits of the same encoded block, might allow the same error to leak into the 
data more than once. The second issue is that a single mistaken check operator can 
sometimes lead us to misinterpret the syndrome, thereby yielding many errors due 
to our "correction" of the data. The second of these problems is dealt with by simple 




(2.85) 
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repetition of the check operator measurements. The first can be avoided in a variety 
of clever ways explained in Subsection 2.6.4, all of which rely, at heart, on coupling 
the data qubits to an ancillary state with nice error properties. 

Ancillary states turn out to be quite useful for fault-tolerant quantum computing. 
Aside from their employment in error correction. Subsection 2.6.3 shows how, in 
conjunction with teleportation, ancillae can be used to fault-tolerantly implement 
encoded gates that cannot be performed transversally. 

Ancillary states are not free, of course, so a full fault-tolerant scheme must also 
include a method of constructing the necessary ancillae. Typically, ancillae are con- 
structed in ways that are not at all fault-tolerant but are subsequently verified to 
ensure that the states are of adequate quality. Subsection 2.6.5 covers the standard 
approach to ancilla construction while Chapter 5 presents a method of my own. 

2.6.1 Pauli-error Propagation 

Before delving into the intricacies of fault tolerance, it is probably useful to introduce 
the notion of error propagation. Pauli-error propagation relies on the fact, mentioned 
in Section 2.3.1, that the Pauli group is invariant under conjugation by Clifford gates. 
This implies that any string of Pauli gates followed by a Clifford gate is equivalent 
to the same Clifford gate followed by some (possibly different) string of Pauli gates. 
Consequently, it is possible to shuffle Pauli errors to the end of a Clifford circuit, 
thus yielding a perfect circuit modified by the resultant terminal Pauli operators 
(see Figure 2.9). Moreover, for a circuit built up from gates in Cg = {H, P,^X}, it 
is possible to perform this error propagation in a computationally efficient way by 
sequential application of a few of the circuit identities defined in Figure 2.4. 

Error propagation is more generally applicable than one might imagine. As sub- 
sequent sections show, fault-tolerant procedures based on CSS codes need never 
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Figure 2.9: An example of Pauli propagation. Equality is up to an overall phase. 



apply non-Clifford gates to the data qubits, so error propagation can typically be 
used to determine the impact of a Pauli error at any later point in a fault-tolerant 
computation. Moreover, as discussed in Section 2.7.4, Pauli-errors are frequently an 
acceptable substitute for more general errors and, even when this is not the case, 
they still provide a basis for any possible error (see Section 2.3.1) whose impact can 
thus be gauged by considering the evolution of this basis. 

Knill [30, 29] has pointed out an additional benefit of applying only Clifford gates 
the data, which is that Pauli gates need never be applied. Rather than applying Pauli 
gates, it is sufficient to determine by propagation the effect that they would have 
on subsequent measurements and to adjust the measurement outcomes accordingly. 
This property is put to good use in Chapter 4. 



2.6.2 Transversal Gates 



The standard method of showing that a transversal operation implements some en- 
coded gate is to demonstrate that it has the desired effect on an arbitrary logical 
state. This might be called the Schrodinger method of gate verification, since it 
concerns itself with transformations of the state. Instead, I take here an operator- 
oriented approach to gate verification. The two are equivalent since the requirement 
that encoded and unencoded measurement statistics match (which is all we really 
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care about in the end), 

{^\O^MO\^/j) = (iplO^MOlilj) , (2.86) 

can be verified either by showing that encoded 0\ip) is 0|?/') or by showing that 
encoded O^MO is O^MO. This second problem is made tractable by the fact, 
discussed in Subsection 2.3.1, that the Pauli operators are a basis for all Hermitian 
operators. For encoded unitaries, since Y = —iZX and OZXO'' = OZO'^OXO\ it 
is sufficient to check the transformations of M = J, X, and Z. The operator I is 
retained here because I use it to represent not simply the unencoded identity but the 
entire stabilizer group and any convex combination of the elements thereof. 

CSS codes are particularly well suited to transversal encoded gate constructions. 
The following paragraphs cover, in quick succession, the implementation of various 
transversal gates on an [[n, 1, c/]] CSS code for n G Odd where the logical X and 
Z operators have been chosen such that X = X®" and Z = Z®"^ . For the purposes 
of my analysis, it is convenient to choose a set of generators Sg of the stabilizer of 
such that Sg partitions into two subsets Sg and Sg , where Sg contains only X-type 
Pauli operators and Sg becomes Sg under exchange of X and Z. 

Destructive measurement need not really be transversal, but the destructive mea- 
surement of any transversal operator can, of course, be performed transversally by 
measuring each of the component operators and taking the product of the measure- 
ment outcomes. 



Let R = E® 
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(2.87) 



which verifies this choice of encoded H . This form of H follows simply from the X-Z 
symmetry of CSS codes and, consequently, applies to any fully X-Z symmetric code. 
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Let = {^Xf\ and thus, %, = (g)^! %h,-kh- 

^jk^k ^jk - ^jk^gk ^jk - ^gk 

C-y '7 C-y '7 oZiCy qZi ( r\ qq\ 

Jijk^j ^jk — ^jk^g-j ^jk — Og-- 

Cy ry Cy 7 7 Q^^^'V c^'i' 

^jk^k ^jk — ^j^k ^jk'^gk — '^g'j'^gki 

which verifies this choice of encoded ^X. Since Equation (2.88) utihzes only the 
segregation of X and Z operators, this form of ^X actually applies to general CSS 
codes as well. 

Not all CSS codes possess a transversal implementation of the phase gate, P. 
There exists a subset of CSS codes, however, those codes with n = 8/ — 1 where 
/ G Z+ constructed from doubly-even dual-containing classical codes^, that satisfy 
the property that the number of X and Y operators in any stabilizer is a multiple of 
4 and likewise for Z and Y operators. For such codes, the logical phase gate is given 
by P = (P"'')®", as can be seen below. 



pxpt 


= (P^Y 




= {iZX)®'' = 
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(2.89) 



where g E N. The addition of a transversal implementation of logical P to that for 
logical H and logical completes a transversal basis for encoded Clifford operations. 
It is notable that the Steane code permits such an implementation of P. 

Given a transversal basis for encoded Clifford operations, the addition of any 
transversal non-Clifford encoded gate would complete a universal set of transversal 
encoded operations on It is tempting, therefore, to seek such an implementation 



code is called doubly even if the weight of each of its codewords is a multiple of 4. 
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of, for example, the 7r/4-rotation, T. Because T is not a Clifford gate, however, it is 
ill suited to my Heisenberg analysis; conjugating a Pauli operator by T®" generates 
sums of Pauli operators. If, instead, we were to fall back on the Schrodinger approach 
to encoded gate verification, we would find that a simple, transversal implementation 
of logical T existed for CSS codes constructed from quadruply-even dual- containing 
classical codes. Unfortunately, quadruply-even dual-containing classical codes do not 
exist (see Chapter 19 of Reference [31]), and there is reason, having to do with the 
interaction between error correction and logical operators, to believe that transversal, 
universal encoded gate sets do not exist for non-trivial quantum codes. To complete 
an encoded gate set, it is necessary to introduce auxihary systems prepared in logical 
states that cannot be constructed using transversal encoded gates. 

2.6.3 Non-transversal Gates 

Transversal encoded operations plus the logical state |0) are not sufficient to imple- 
ment an arbitrary encoded operation. Universality is obtained, while satisfying the 
constraints of fault tolerance, by introducing the encoded form of some state such as 



The usefulness of this state derives from its ability to serve as a proxy for the T gate, 
as is shown below. 

The circuit 



e-/^> = i=(|0)+e^-/^|l)). 



(2.90) 




(2.91) 



transfers the state \ip) = a|0) -|- /3|1) from the first to the second qubit. This can be 
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seen from the action of the '"X gate 



^21 



+) = ^X2,(a|0)+/3|l))^(|0) + |l)) 



1 

1 

1 

71 



((«|0)+/3|l))|0) + (a|l)+/3|0))|l)) 
(|0)(«|0)+/3|l)) + |l)(a|l)+/5|0)))' 

(|0)|^) + |1)X2|^)) 



(2.92) 



after measurement and the conditional correction, the state of the second qubit is 
lip). We might, therefore, apply the 7r/4 rotation T to the state \ip) by applying 
the circuit in Equation (2.92) and subsequently applying T to the output. This 
roundabout application is interesting because of the way that it can be transformed 
by circuit identities. 
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(2.93) 



The final circuit of Equation (2.93) is composed entirely of Clifford gates. Thus, 
since T together with the Clifford gates constitutes a universal gate set, the ability 
to apply Clifford gates and to make the state |e"^/^) is sufficient for universal quantum 
computation. In terms of fault tolerance, this allows us to transmute the problem 
of non-transversal gates into a new problem, that of preparing encoded ancillae ['2o]. 
The details of ancilla preparation are discussed in Section 2.6.5. 



2.6.4 Error Correction 

Error correction entails projectively measuring non-local operators on the data qubits. 
There are a variety of way to do this. For any Hermitian operator M with eigenvalues 
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±1, the circuit 



|0) - H — H (2.94) 



M 



performs a projective measurement of M on the state {tp). This follows from the 
action of the unitaries 

i/i^Mi2ifi|0)|^) = ifi^Mi2-^(|0) + 

= H,^i\om + \i)M,m 

= ^((|0) + |1))|^) + (|0)-|1))M2|^)) (2.95) 

, , (/2 + M2) , , , , (I2 - M2) , , 
= |0) o V ) + |1) 



and the fact that the projectors onto the ±1 eigenspaces are (/±M)/2 for an operator 
M with only ±1 eigenvalues. The same circuit works for an w-qubit measurement 
M with eigenvalues ±1, in which case the control gate becomes M, a multi-qubit 
operation controlled by a single qubit. When M decomposes into a tensor product 
of single qubit operators with w non-trivial elements, 

w 

M = l[Ml (2.96) 

i=l 

(J — 

M decomposes into w two-qubit controlled-gates 



^M,., = n''(M^),>., (2.97) 

i=l 



and, for M G P", M can be implemented using Clifford gates exclusively. 

The problem with this form of measurement is that it is not fault tolerant. Imag- 
ine, for instance, measuring X^X^X^Xon the four-qubit error-detection code 
Ii%2 ° ^2 ■ Figure 2.10 shows that a single Z error on the ancilla qubit halfway 
through the measurement results in two errors on the data qubits. This is possible 
because the measurement circuit couples the same ancillary qubit to multiple data 
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Figure 2.10: Error propagation in a non-fault-tolerant circuit for measuring 
XiX^X^Xj. Circuit identities show that a single X error occurring on the ancil- 
lary qubit halfway through the measurement generates two X errors on the data. 



qubits. The problem can be avoided in at least three different ways through the use 
of more complex ancillae. 

The first, and most broadly applicable, fault-tolerant method of performing syn- 
drome extraction was developed by Peter Shor [ ]. Figure 2.11a demonstrates 
Shor-style measurement of ZiZ^Z^Z^ for the Steane code. Compared to other fault- 
tolerant methods of syndrome extraction, Shor's method requires the smallest ancil- 
lae. No assumptions about the nature of the code are required, though the measure- 
ment operator must be transversal and have eigenvalues ±1. 

Andrew Steane developed [ i] a method of syndrome extraction requiring fewer 
ancilla qubits and fewer gates to be applied to the data, but at the cost of requiring 
larger, more complex ancillae. The process of Z error extraction {X generator mea- 
surement) using Steane's approach is illustrated for the Steane code in Figure 2.11b. 
Steane's method requires that the data be encoded using a generalized CSS code. 
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Knill's method of syndrome extraction [29] is really less about extracting the 
syndrome than about determining the encoded operation that must be applied to 
complete teleportation of the data from one location to another. Figure 2.11c shows 
Knill's method for the Steane code. It requires larger ancillae than either of the other 
two methods, and the ancillae must not have correlated errors of any type, but the 
number of gates applied to the data is reduced to a bare minimum, as are the effects 
of gate errors. No additional assumptions about the nature of the code are necessary 
over those of Steane's method. 

The next three sections provide detailed information on each of these methods of 
syndrome extraction. 



Shor's Method 



Shor's method [49] replaces the control qubit of the projective measurement circuit 
with a block of qubits prepared in the logical state 

1+) = ^(|0)^'" + |1)^"') (2.98) 
v2 

of the w-qubit bit-flip code , a state better known as the w-qubit cat state. 
For measurement operators of the form given in Equation (2.96), this permits the 
transversal application of since 



w 

4n>0i,,„^iio)^^^-^^^(io)+ii))iv^>, 

i=l ^ 

= El^{\Or\^P)-^ + \lf-M,\^p)^) 
where E is the encoding unitary that takes \0)'^^^"^^ \(f)) to the state 



(2.99) 



encoded in 



. The encoding unitary E is not fault tolerant, so the cat state must by prepared 



Chapter 2. Background 



59 



separately and tested for X errors {Z errors only affect the measurement outcome, 
not the data) prior to coupling it to the data. The decoding unitary typically does 
not appear in the check measurement circuit either since decoding the ancilla and 
then measuring the output is equivalent to measuring each qubit in the |±) basis and 
interpreting the parity of the measured states as the outcome of the check operator 
measurement. This works because 

1 (lO)®- + \if^) = 2-(-+i)/2 J2 \a) and 

aeEven 

1 (10)®" - llf") = 2-('"+i)/2 J2 \a). (2.100) 

aGOdd 

While fault tolerant, however, the outcome of such a measurement is not partic- 
ularly reliable; a single error at any of a variety of locations results in a mistaken 
syndrome bit. Thus, before any errors can be corrected, it is necessary to repeat 
the check measurement until a consecutive sequence of c agreeing results are ob- 
tained where c is equal to the lesser of the number of correctable errors plus one and 
the maximum number of errors that might be generated by accepting an inaccurate 
measurement. 



Steane's Method 



Steane's method [52] takes a very different approach to extracting syndrome infor- 
mation. Rather than measuring each check operator individually, the existence of a 
set of segregated generators (into X-type and Z-type) for CSS codes is exploited to 
divide measurement of the entire syndrome into two phases, one where all Z error 
information is copied to an ancillary logical basis state, and one where all X error 
information is copied. The logical circuit for extracting X error information from 
the state |'?/') is simply 
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(2.101) 
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Figure 2.11: Measurement of the Steane-code stabilizer XiX^X^Xj using a) Shor's, 
b) Steane's, and c) Knill's method of syndrome extraction. Shor's method is the most 
widely applicable and necessitates the smallest ancillae. Steane's method applies to 
all CSS codes and extracts all syndromes corresponding to a single kind of stabilizer 
[X or Z) at once. Knill's method requires the largest ancillae, but it extracts all X 
and Z syndromes while minimizing the number of gates applied to the data. Given 
ancillae with uncorrelated errors, each of these methods is fault-tolerant. 
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That this circuit has no effect on the logical state of the data follows from the first 
form of Equation (2.101), in absence of error, the initial '"Z does nothing since the 
upper logical qubit is in the state |0). To see why it extracts X error information, 
consider instead the data state A^ijj'^ where A is an X-type Pauli error. Using circuit 
identities, 
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h- 




- A 
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1+) - A 



A 



A 



(2.102) 

Thus, X errors are copied to the ancillary state where they result in measurement 
errors. The location of those errors can be identified using the classical code cor- 
responding to the Z check operators. Just as X errors in the data migrate to the 
ancilla, however, Z errors may enter the data from the ancilla, so it is necessary to 
check the ancillary logical qubit very carefully for Z errors prior to use. Extraction 
of Z errors from the data proceeds in an identical manner via the logical circuit 



|0) 



H 



(2.103) 



Given ancillae without correlated errors, the demands of fault tolerance are sat- 
isfied by a single application of Steane's method of syndrome extraction. So long 
as the number of errors in the circuit does not exceed t, the number of errors cor- 
rectable, the measurement can be decoded correctly to reveal location of X errors on 
the ancilla. In such a situation, no error can reach more than one data qubit since 
data qubits are not coupled and mistaking the value of a single ancilla qubit only 
results in a mistaken recovery operator on the associated data qubit. 

While unnecessary for fault tolerance, however. Chapter 4 shows that multiple 
syndrome extractions can improve performance even when no correlations exist be- 
tween errors on component qubits of ancillae. Moreover, for correlated errors of the 
type that do not propagate to the data (the other kind are unsalvageable) verification 
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of syndrome information by multiple extractions is an absolute necessity. A typical 
approach is to repeat the syndrome extraction until t + 1 consecutive measurements 
agree. For [[n, k, 2t + 1]] codes with n and t large this becomes impractical since 
extractions will almost never agree. The appropriate number of repetitions then 
depends on the details of the error model describing the ancillae. 



Knill's Method 



Like Steane's method, Knill's method [ ] of fault-tolerant syndrome extraction relies 
on the ability to segregate the stabilizer generators and the availability of ancillary 
states encoded in the same quantum code as the data. All X- and Z-error information 
is extracted at once as the side effect of teleporting the logical state to a new location. 
The basic logical circuit for this method is 



|oo)+in) 
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(2.104) 



Examining the effect of the unencoded form of the first circuit for = a\0) + (3\1) 



H^''X32^{\00) + \ll)){a\0)+(3\l)) 



V2 
1 



HsiailOOO) + |110)) +/5(|011) + |101))) 



a(|000) + |001) + |110) + |111)) 



(2.105) 



+ /?(|010) - |011) + |100) - |101))) 
= (a|0)+/3|l))|00) + (a|0)-/3|l))|01) 

+ («|1)+/?|0))|10) + («|1)-/5|0))|11) 

= iv)ioo) + z.mol) + x.mio) + x,z,moo) 

we see that, modulo a corrective Pauli operator that depends on the outcome of the 
measurements, \ip) is teleported from the last to the first qubit. The logical circuit 
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has precisely the same effect on logical states, but, just as in Steane's method, some 
(in this case the X) error information is transferred by the encoded to another 
logical qubit. The final measurements thus reveal both the X-error locations and 
the Z-error locations, both mixed with the errors from the logical ancilla qubit. This 
information is useful, however, only in that it comes in the process of decoding the 
logical measurement value. The logical state no longer needs to be corrected for 
the discovered errors; it has been teleported clear of them. In effect, Knill's method 
exchanges the error distribution of an encoded data block for that of one half of an 
ancilla prepared in a logical Bell state. Failure occurs only when too many errors 
are present in the measurement outcomes to correctly identify the encoded Pauli 
operator needed to complete the teleportation. 

By its nature, Knill's method of syndrome extraction is not verifiable; the original 
logical qubit is obliterated by the process. It is therefore fortunate that, like Steane's 
method, it is fault tolerant even without repetition. As in Steane's method, however, 
this only holds for ancillae with uncorrelated errors; correlated ancilla errors are fatal. 

A convenient feature of this form of syndrome extraction and correction is that 
encoded single-qubit gates can be accomplished by performing the teleportation using 
a logical Bell state^ prepared with the desired gate already apphed to one half. 

2.6.5 Ancilla Preparation 

By now it should be clear that ancilla preparation is a crucial part of fault tolerance. 
Ancillae in non-trivial states are the means by which a variety of non-fault-tolerant 
operations are made fault tolerant. Such ancillae are not part of our assumed re- 
sources, however, so, in order to be complete, a procedure for fault-tolerant quantum 
computing must also include a method of making the requisite ancillae. 

^If desired, multi-qubit gates can be performed using larger and more complicated 
ancillae. 
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Useful ancillary states tend to be complex, highly-entangled objects. By defini- 
tion, entanglement cannot be generated between two systems without allowing them 
to interact, so fault- tolerant ancilla construction cannot simply rely on avoidance 
of couplings within an encoded block. Instead, ancillae are typically constructed 
in ways that are not particularly resistant to errors, and fault-tolerance is achieved 
through verification and the discard of dubious ancillae. Full verification is possi- 
ble since the exact target state is known, and both the abihty to discard suspect 
states and the low error rates assumed for bare qubits improve the output of state 
construction. 

In this section 1 discuss ways of constructing the ancillae that I have used up till 
now which are appropriate for codes on small numbers of qubits. 



Cat states 

I begin with an explanation of cat state construction, both because it is the simplest 
of the construction routines, and because cat states are a kind of primordial ancilla 

employed in the construction of many other ancillary states. In the absence of errors, 
cat states are easy to construct; the unitary 



H (2.106) 

i=w—2 

for example, transforms the state 10)®*" to an w-qubit cat state since 

n %,+i//iio)^-= n %m-^(io>+ii))io)^(-^) 

i=w—2 i=w—2 ^ 

2 



n %m-^(|00) + |11))|0)^("'-^) = . . . = ^{\0f- + 

i=w—2 ^ ^ 

(2.107) 
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If the jth X should fail in such a way that an X error is generated on the target, 
however, the state produced would instead be 

1 (|Q^®i|^^0(--i) ^ |i)®i|o)«(— J-)). (2.108) 
v2 

Employing such a damaged ancillae in, for example, Shor-style syndrome extraction 
would result in min(j, w — j) X errors on the data, all from a single failed gate. 

To verify cat states against X errors we measure random Z stabilizers ZiZj 
discarding the state if an error is found [ i ]. Two-qubit stabilizer measurements are 
easily performed fault-tolerantly using the unitary given in Equation (2.97). (This 
is what makes the Bacon-Shor codes [ ] so interesting.) Ignoring uninvolved qubits, 
the circuit for measuring the stabilizer A® B is 



\ 


> 




















1 


3 









(2.109) 



This circuit is fault-tolerant since any single error affecting both data qubits must 
occur before the application of '"B and therefore must be transmitted to the second 
data qubit by the '"B gate, implying that the error has the form E ^ B, but A® B 
is a stabilizer, so E ® B is equivalent to [A ® B) ■ {E ® B) = {AE) /, which is a 
single qubit error. 

In general, cat state verification requires measuring many Z stabilizers. For the 
case of a ti'-qubit cat state prepared using the unitary in Equation (2.106) and utilized 
with an [[n, k,d = 3]] code, however, it is sufficient to measure ZiZ^ since any single 
X error that spreads will continue to spread all the way to the last qubit. 

Logical Clifford states 

After the cat states, the most commonly used ancillae are phaseless logical Clifford 
states such as |0), |+), and (|00) + \ll))/\/2. Since the encoded '"X and H gates are 
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transversal for CSS codes, the ability to prepare any of these states fault-tolerantly 
implies the ability to prepare the rest. In this section I discuss the preparation of 

!+)• 

Let G and H be the generator matrix and the parity check matrix of the classical 
code associated with a CSS code If the logical X operators of ^„ are all X-type 
Pauli operators then the stabilizer generator 



(2.110) 



specifies the logical eigenstate |+). From Section 2.3.2 we know how to express a 
stabilizer state in terms of an equally weighted sum over all stabilizers applied to 
some state, 



G 








H 



1+) oc 2-" Yl ^lO)""" = 2"" n + 

Ce-S DGSg 



0n 



(2.111) 



For a stabilizer with segregated generators, like that in Equation (2.110) 
1+) oc2-"5^C|0)®" = 2-" 5^ J2 ABlOf" 

= 2-^ Yl ^1°)''" = 2-("+'=)/2 Y A|0)®" 



c&s 



(2.112) 



which shows that |+) is just an equally weighted superposition over all the bit strings 
in G. To create and verify this superposition, it is helpful to switch to a generator 
matrix G that is row reduced. For simplicity I assume that we can do this without 
swapping qubits. Let be the set of qubits {j\j > i and G*-' = 1}, the unitary 



in+k)/2 

n n% ^^ 



(2.113) 



applied to |0) " produces |+). To see why, consider the i = 1 term of Equa- 
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tion (2.113). The effect of this unitary on 10)^" is to produce 

(2.114) 

that is, it apphes the operator (/ + Sg^) to the state |0)®", where Sg is the X- 
stabihzer generator corresponding to G. Since G is row reduced, the control qubits 
for the other values of i are untouched by this process, so everything works the same 
for ^ = 2, etc., and the effect of the entire unitary on |0)®" is to produce 

2-(n+fc)/2 -Q (j + /^)|o)«« = 2-("+'=)/2 J2 AlOf" = (2.115) 

This preparation procedure for |+) is not fault tolerant, however, so the state must 
be checked for errors. To check for Z errors the X-stabilizer generators are measured 
using Shor's syndrome extraction method. Any state which passes all checks (failures 
are typically discarded) has been verified sufficiently for use in Steane's method of 
syndrome extraction. For other uses, X errors must also be tested for, either using 
Shor's method or by Steane-style syndrome extraction. The logical circuit for a single 
round of verification of |+) using Steane's method is 



1+) - 
1+) 



(2.116) 



Figure 2.12 shows explicitly the circuits involved in constructing |+) for the Steane 
code and checking it for Z errors. 



The 7r/4 state 



The state e^'^/^j was used in Section 2.6.3 to complete our encoded gate set, but 
none of the construction methods yet covered applies to this state. In this section I 
remedy that oversight by presenting two different methods of constructing e™/^ 
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Constructing the tt/A state through measurement 

One way that we have not yet used to prepare logical states is through measure- 
ment. Projective measurement leaves behind the eigenstate corresponding to the 
measured eigenvalue, effectively extracting an eigenvector. Thus, we can probabilis- 
tically prepare the eigenstates of any operator that we can measure, providing only 
that our initial state is not orthogonal to the desired state. To use this fact to pre- 
pare |e"^/^), we need to measure an operator with eigenvectors |±e"^/'^). As shown 
below, e~'''/'^PX has just these eigenvalues. 



e-W4px|±e'"/^) = e-'"/^PX^(|0) ±e^"/^|l)) 

v2 

v2 



-i7r/4 



V2 



(i|l)±e^-/^|0)). 



±i=(|0)±e'-/^|l)) = ±|±e-/^> 



Moreover, the state |0) can be decomposed as 
|0) 



|-e'-/^». 



(2.117) 



(2.118) 



so a projective measurement of e "^/^PX on |0) yields the state je^'^/^) and the state 
|— e™/^) with equal probability. 



From the preceding paragraph it is clear that one method of preparing 



e'-^/^ ) is to 



measure the encoded operator e~"^/^PX on the logical state |0), either trying again 
or applying Z when the outcome is —1. Implementing the encoded measurement 
circuit 



|0) 
|0) 



H 



H 



|0) 
|0) 



H 



Tt 



H 



(2.119) 



PX 



however, requires the encoded T"^ gate, and the entire point of this exercise is to allow 
us to implement T. Taking our cue from Section 2.6.4 we replace the control qubit 
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1+) ^ 
|0) 



H 



PX 



+ 



|0) 



1 



H 



(Ptx) 



0n 



|0) 



H 



-i7r/4 



X) 



1 + ) 

|0) 



Figure 2.13: A string of circuit identities decomposing a logical measurement circuit 
for e~''^/^PX into one involving only single qubit gates. These equivalences make 
use of the facts P = pt®"^ e^^^TXT'^ = PX, and T = Ti and the equivalence for 
codestates of the repetition code (see Section 2.6.5) of transversal Hadamard followed 
by transversal measurement to logical Hadamard followed by logical measurement. 
As before, the tilde overtop states indicates they are encoded using the repetition 
code. 



with a logical qubit prepared, not in the code of interest, but in the n-qubit repetition 
code. For CSS codes such that P = pt®*^^ ^j-^e resulting encoded measurement circuit 
can be reexpressed in terms of unencoded gates exclusively, as shown in Figure 2.13. 

As with other Shor-style measurements, the encoded measurement presented here 
is not robust, necessitating t+1 extractions for a t-error-correcting code. The problem 
is more dire here, however, since a single error on the data can change every measure- 
ment result, causing us to mistakenly identify the prepared state. To guard against 
this possibility, error detection must be performed between the measurements. 

Constructing the ir/A state through teleportation (State Injection) 

We saw in Section 2.6.4 how an arbitrary state could be teleported using an ancilla 

prepared in the Bell state |/3oo)- Teleportation is achieved by performing a measure- 
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ment in the Bell basis on the qubit to be teleported and one of the qubits of the 
Bell state. Modulo possible corrections indicated by the measurement results, this 
operation transfers the state of interest to the unmeasured qubit of the Bell pair. 

For this section, the crucial fact to note about the teleportation process is that 
the two halves of the Bell pair do not need to be encoded using the same code. Thus, 
the state (|00) + \ll))/^/2 can be used to teleport an arbitrary unencoded state to 
the encoded qubit of the Bell pair. The relevant circuit diagram is 



H 



|oo)+in) 

V2 



X 



(2.120) 



Z 



where P is a logical decoder. The teleported state must subsequently be verified 
since a single error either during the decoding process or on the unencoded state 
can result in a logical error on the encoded state, but verification is feasible since 
the correct state is produced with probability near 1. This method of logical state 
preparation, known as state injection [2')], can be applied for any unencoded state, 
including the state je^'^/^). 

The techniques required for state injection have already been introduced. Earlier 
in Section 2.6.5, a means of constructing encoded |+) was described. As mentioned 
there, the transversal gate implementations of H and '"X given in Section 2.6.2 per- 
mit the fault-tolerant construction of encoded (|00) + |ii))/v^ from the state |+). 
Decoding of the second logical qubit in the encoded Bell pair can be accomplished by 
reversing an encoding process similar (though not quite identical since care must be 
taken to design an encoder, minus verification procedures, that works for arbitrary 
input states) to that presented for |+). A single round of verification of e"^/"^ ) can be 
implemented using a second copy of 



e"^/^y and a modified form of the logical mea- 
surement procedure (no cat states required) described for the previously presented 
method of preparing e>'"l'^ 
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2.7 Thresholds 

No matter how skillfully designed the fault-tolerant procedure, there remains for any 
finite code afflicted by independent errors a nonzero probability that too many errors 
will occur in a computational step and our data will become irreparably corrupted. 
Consequently, as the length of the computation increases, the probability of failure 
approaches one. To ensure that we can perform computations of arbitrary length, 
we need a way of making the probability of an uncorrectable set of unencoded errors, 
i.e., an encoded error, arbitrarily small. 

Currently, the only known, viable method of achieving an arbitrarily low en- 
coded error rate is by concatenation of fault-tolerant procedures. As explained in 
Section 2.6, a fault-tolerant procedure replaces the states and gates in an unencoded 
circuit with encoded versions satisfying certain desirable restrictions with regard to 
errors. The resulting circuit implements the unencoded circuit in an encoded sub- 
space where qubits and operations display different error properties. Concatenation 
of fault-tolerant procedures works in roughly the same way as code concatenation; 
the encoded states and gates of a fault-tolerant procedure are used as the "unen- 
coded" states and gates for the subsequent level of encoding. The basic idea behind 
this process is the following: "If encoding qubits and gates reduces the effective error 
rate then encoding the encoded qubits and gates should reduce the error rate even 
more." This provides us with a plausible sounding way of achieving an arbitrarily 
small error rate; we simply add layers of encoding until the error rate is acceptable. 

But encoding will not always decrease the error rate. It is possible for hardware 
to be so error prone that the process of applying an encoded gate and error cor- 
recting is less likely to succeed than simply applying the unencoded gate. From this 
observation arises the idea of a threshold error probability pth for quantum computa- 
tion. The threshold is the unencoded error probability below which we can achieve 
an arbitrarily low encoded error probability using a number of qubits that scales 
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polynomially in the size of the problem. Put another way, it is the error probabihty 
below which we can compute indefinitely. 

Determining the threshold exactly for a given set of assumptions has proven to be 
a hard problem, but we can get some idea of its value through bounds and estimates. 

2.7.1 Threshold Estimates 

Estimates of the threshold for quantum computation are generally made by ana- 
lyzing a particular fault-tolerant implementation using a specific finite code under 
concatenation. Fundamentally, these estimates derive from the idea that encoding is 
undesirable if 

r Encoded ^ f Unencoded 

I error rate J I error rate 
Intuitively, this makes sense; we would not expect error correction to be advanta- 
geous when an encoded gate or qubit is more likely to fail than an unencoded one. 
Equation (2.121) harbors some ambiguity since there are many sorts of errors, and 
it might well be the case that the encoded error rate increases for some of them, 
but not all. That ambiguity can be resolved either by specifying that the inequality 
holds for the lowest encoded error rate and the highest unencoded error rate, or, for 
a finite set of possible errors, by specifying that it holds for each kind of error. 

Putting aside the matter of diverse error species, however, consider the implica- 
tions of Equation (2.121). Showing that a fault-tolerant procedure satisfies Equa- 
tion (2.121) does not provide an upper bound on the threshold for quantum comput- 
ing since some other procedure might perform better given the same unencoded error 
model. Nor does proving the opposite provide a lower bound. While an increase in 
the error rate at the first level of concatenation implies that subsequent layers of 
concatenation also increase the error rate, the converse is not true. To see why, 
imagine that we have some fault-tolerant procedure for which the encoded failure 



(2.121) 
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rate is less than the unencoded failure rate. At the first level our code is constructed 
of unencoded qubits that are either perfect or have failed. At the second level of 
encoding, however, our code is constructed of singly encoded qubits that may be 
perfect, insufficiently corrupted to result in failure, or failed, yet only the last case is 
considered an encoded error. In some sense, the qubits that have not failed are now 
of lower quality than they were at the previous level. Thus, the fact that encoding 
worked at the previous level does not guarantee that it will work at the current one. 

Given a precise mapping between encoded error rates and unencoded error rates, 
the threshold error probability is bounded below by the unencoded error rate such 
that, for a particular fault-tolerant procedure, the encoded error rate is the same. 
In absence of a precise mapping, the aforementioned probability might be called an 
approximate lower bound on the threshold. Generally, however, I think that the 
goal of researchers who perform such approximate calculations is not to bound the 
threshold so much as to estimate its value for a particular fault-tolerant procedure. 
Consequently, I more often use the appellation "threshold estimate" than "approxi- 
mate lower bound on the threshold" . 

Following Aharonov and Ben-Or [2, 3], we can estimate the threshold analytically 
by counting the number of unencoded gates g involved in the most complex encoded 
gate of a particular procedure. A fault-tolerant procedure does not spread errors, so 
t + 1 errors must occur to generate t + 1 errors on an encoded block. The number of 
ways to choose t + 1 errors on g gates is {^^^ , so, if there are no memory errors or 
initial errors on the data blocks, the encoded failure probability, i.e., the probability 
of t -|- 1 or more errors occurring, is bounded by 



Encoded 




(2.122) 



error rate 



where t is the number of correctable errors and p is the probability of an unen- 
coded error (or the largest such probability if there are several). The corresponding 
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threshold estimate is 

This is an estimate rather than a bound because the data blocks to which the encoded 
gate is applied are not necessarily free of errors initially. An accurate accounting these 
errors is difficult to make since they ultimately depend on the errors, successes, and 
failures of preceding gates. 

The standard numerical approach to estimating the threshold, introduced by 
Christof Zalka [ ], is to program a Monte-Carlo routine that propagates Pauli errors 
through a sequence of gates corresponding to an encoded gate while, with some 
pre-assigned probability, causing unencoded gates randomly to fail and generate 
additional Pauli errors. Over the course of many runs, statistics on encoded gate 
failures are collected and used to approximate the encoded failure rate associated 
with the unencoded error probabilities utilized. By repeating this process for a 
sequence of unencoded error probabilities enough data can be obtained to make a fit 
of the encoded error rate as a function of some small number of parameters in the 
unencoded error model. The threshold is then taken to be the point (or surface) where 
the encoded error probability becomes less than the unencoded error probability. A 
number is obtained for the threshold only when the encoded error probability is a 
function of a single parameter. This is generally accomplished by setting the failure 
probability of all unencoded gates to be equal and assuming that gate failures obey 
the depolarizing error model, that is, that they produce all possible Pauli errors 
with equal probability. Aside from the fact that it is approximate, the primary 
disadvantage of this method of threshold estimation is that it is computationally 
prohibitive to apply to large codes or error models with many degrees of freedom. 

Until recently, most results regarding thresholds have been one sort of estimate 
or another. Extraordinarily, these estimates have currently settled near the 1% 
mark [44, 29]. For a more detailed discussion of the possible pitfalls of such calcu- 
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lations, the reader is referred to the work of Svore and others [55, 54]. Chapters 3 
and 4 focus largely on producing further estimates of the threshold in various cases 
of interest. 

2.7.2 Upper Bounds on the Threshold 

Several rigorous upper bounds on the threshold for fault-tolerant quantum comput- 
ing have been produced by proving that quantum gates suffering from exceedingly 
probable Pauli errors can be simulated by a classical computer. I do not discuss 
upper bounds on the threshold further, but it is interesting to note that such proofs 
have pushed the depolarizing threshold below 50% [12, 42]. 

2.7.3 Lower Bounds on the Threshold 

The art of obtaining rigorous lower bounds on the threshold is experiencing a re- 
naissance [5, 45, 6, 4]; recently lower bounds have reached 10~^. In what follows, I 
briefly review a method of bounding the threshold developed by Aliferis, Gottesman, 
and Preskill. A much more detailed treatment is contained in their very readable 
paper [5]. 

The analytical approach to calculating the threshold presented in Section 2.7.1 
could not be used to establish a lower bound because encoded and unencoded errors 
were incommensurate, effectively precluding an induction step that might prove an 
arbitrary reduction in the error rate were possible. The primary origin of this diffi- 
cultly lay in the dependence of encoded errors on failures outside of the scope of the 
encoded gate, though a secondary problem arose from the complexity available to a 
quantum error model. 

Aliferis et al. do away with the complexity of quantum error models by adopting 
an adversarial local error model at each level of encoding. As with other local 
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error models, failures are assumed to strike individual components randomly and 
independently, but the errors induced are assumed to be the most destructive ones 
possible on the faulty component. This error model is unphysical since the most 
destructive errors at one level of encoding are not necessarily compatible with the 
most destructive errors at the next level of encoding, but the threshold can only 
be reduced by considering such harsh errors, so the associated threshold probability 
can safely be interpreted as a lower bound. Moreover, this choice eliminates the 
enormous burden of identifying the encoded failure that results from an excess of 
component failures. 

Establishing the independence of successive levels of encoding is more compli- 
cated. I begin by defining some terms for a d = 3 error correcting code; in these 
definitions, fc-Ga and k-EC stand for gates and error corrections at the k-th level of 
encoding. 



fc-Rec An encoded gate and the subsequent error corrections in a kth level circuit. 



fc-Ga — k-EC 



k-Gn — k-EC 



fc-exRec An encoded gate and the preceding and subsequent error corrections in a 
fcth level circuit. 



,.p. 



k-EC H A;-Ga Y+\k-EC H A;-Ga H k-EC 



Bad A A;-exRec is bad if it contains two independently bad {k — l)-exRecs. 

Dependent A pair of bad /c-exRecs are dependent if they overlap and the first k- 
exRec is not bad when the overlapping k-EC is ignored. 



k-EC 




A;-&a 


-H 

..I. . 


k-EC 



k-Ga — k-EC 



Good A k-exRec is good if it is not bad. 



From these definitions it follows that a good fc-exRec takes a valid input block, 
that is, a data block having at most one error at the beginning of the /c-Rec, to a 
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valid output block. In order for there to be one error at the beginning of the fc-Rec, 
at least one error must have occurred during the leading error correction, implying, 
since the /c-exRec is good, that no additional errors occur and the output state is 
flawless. By contrast, if there are no errors on the data at the location preceding 
the fc-Rec then an error might happen during the k-Kec, but, since the procedure is 
fault-tolerant, this yields no more than a single error on the output. 

Ultimately, of course, we care whether a circuit gives the correct answer, not how 
good it is. A correct A;-Rec should satisfy 



k-Rec 



ideal 






ideal 




ideal 


/c- decoder 






fc-decoder 




0-Ga 



(2.124) 



For a 1-Rec contained in a good 1-exRec, fault-tolerance implies that 



1-Rec 



ideal 




ideal 




ideal 


1-decoder 




1-decoder 




0-Ga 



(2.125) 



Using Equation (2.125) as a base case. Figure 2.14 shows the inductive argument 
necessary to prove that a k-Rec that is part of a good k-exRec is correctly decoded 
by an ideal decoder. 

From all of this we learn that the analytical approach of Section 2.7.1 was basically 
right, but that, instead of counting the number of gates in an encoded gate ( 
1-Rec), we should have been counting the number of gates in an encoded gate plus 
the leading error correction (a.k.a. a 1-exRec). 

Thus, a number bounding the threshold for quantum computation can be ob- 
tained by, for instance, setting the failure probabilities of all gates to be equal (mul- 
tiple free parameters yields a surface instead of a number), and counting the gates 
involved in a 1-exRec. If g is the number of gates in a 1-exRec and no memory errors 
occur then a loose bound on the threshold pth is 



Pth > 



(2.126) 
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{k + 1)-EC — {k + 1)-Rec — {k + l)-decoder 



{k + 1)-EC — {k + 1)-Rec 
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Figure 2.14: The threshold dance, the inductive step of the proof that a k-Kec which 
is part of a good fc-exRec is correct. The dance is performed with ideal decoders. 
The /c-decoders pictured act on the bottom k levels of encoding. If the fc-exRec is 
good, then the resulting 0-Ga is perfect. 



A tighter bound can be obtained by only counting malignant pairs of faults, that is, 
those pairs faults that might possibly cause an encoded failure. For a particular error 
model, even higher values of the threshold result from factoring in the probability 
that a pair of failures generates a fatal error, but the probability thus obtained is not 
a bound since the encoded error model will not match the unencoded error model. 



2.7.4 Propagating More General Errors 

The codes and procedures considered in this chapter are intended to combat arbitrary 
independent, local errors, but my analysis of them has been almost exclusively in 
terms of stochastic Pauli channels, that is, error models in which only Pauli errors 
occur. 

In Section 2.5.6 we saw that the measurement of check operators can be used 
to project arbitrary errors affecting at most t = [{d — l)/2j qubits into the Pauli 
basis. This observation does not, by itself, absolve us of the need to consider other 
error models, however, since many gates are typically performed between each mea- 
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surement of a check operator. To determine whether a stochastic Pauh channel is 
an suitable substitute for another error model, it is necessary to consider how each 
builds up over the course of a computation. This question has been examined by 
several authors [3, 31, 40]; my own treatment is given below. 

Begin by considering an arbitrary trace-preserving error operator S. The action, 
on a state p, of any such error operator can be written as 

S{p) = J2e^pE^^ where ^E^^E^=I. (2.127) 

i j 

By interspersing errors of this form with perfect quantum gates it is possible to 
model any faulty quantum circuit that does not suffer from leakage. When the 
error operators are local, it makes sense to approximate them by stochastic Pauli 
channels. Given a local error operator satisfying Equation (2.127), I define the 
associated stochastic Pauli channel to have error probabilities 

p, = ^|tr(E^X)f , 

j 

Py = Y^ |tr(E^r)|^ , and (2.128) 

j 

j 

By design, this channel correctly reproduces the probability of measuring that a 
given Pauli error occurred after a single application of the general error operator. 
Its suitability in more varied circumstances is the subject of the remainder of this 
section. 

In practice, many gates are required (and therefore many error operators act) 
between each error correction, so it is important to know how errors accumulate. 
As for the case of stochastic errors, general trace-preserving errors can be separated 
from the associated circuit providing that it is composed exclusively of Clifford gates. 
The separation is accomplished by applying Pauli propagation to each term in the 
Pauli-basis decomposition of the elements, e.g. , of the error operator. If the 
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Clifford circuit is fault tolerant, then error propagation maps single-qubit errors to 
single-qubit errors (on a given encoded block). Each of the resultant error operators 
differs from the actual error by a simple relabeling of the local Pauli basis. Thus, the 
intervening circuit can be disregarded; it is sufficient to consider how the transformed 
local errors accumulate. 

A sequence of s single-qubit trace-preserving errors acting on a state p can be 
written as 



s 



js 32 ji \k=s / \A:=1 



(2.129) 



The probability of measuring, for example, an X error on the resulting state is 



js j2 jl 



1 



tr 



\k=s 



(2.130) 



By contrast, replacing the error operators with their associated stochastic Pauli 
errors, as defined in Equation (2.128), yields 

s 

= E E HE'''X)\' + 0{p^). (2.131) 

k=l jk 

When and psx (and the equivalent probabilities for Y and Z) agree, the associated 
stochastic Pauli channel is a good substitute for the actual error channel. To the 
lowest nontrivial order in p, the condition for equality can be derived as follows. 

Consider each error operator £^ as a function of the total single application error 
probability p. Taylor expanding the elements of S in y/p yields 

E' = I + ^{aoX + f3oY + 70Z + 60I) + 0{p) 

(2.132) 

E^Vo ^ ^(ajX + p,Y + ijZ + 5jl) + 0{p) 

where the freedom in the has been used to assure that E^ contains the only term 
independent of p. 
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Inserting expanded error operators of the form given in Equation (2.132) into 
Equation (2.130) and discarding terms of order greater than p yields 

2 



tr 



5^tr [E^'X) 



k=l 



(2.133) 



Thus, to first order in p, the difference between p^x and p^x is 



Pex Ps} 



p 



=1 

s 



k=l 



-^|tr(EO'=X)| 

k=l 



(2.134) 



k=l 



= P^^f^okQoi. 

k=l l^k 

As suggested by Preskill [10], this expression has a simple interpretation in terms 
of a 2-D walk composed of s steps of sizes |^aofc|- Equation (2.134) is equal to 
the difference between the square of the displacement for such a walk and the ex- 
pectation of the square of the displacement assuming that the walk is random, that 
is, that stepping forward and backward are equiprobable. Thus, the expectation of 
Equation (2.134) vanishes if the sign of a^k is random. Taking a slightly different 
approach, we can treat the entire expression as the displacement of a 2-D random 
walk composed of s(s — 1) steps of sizes \pciQk(^Qi^k\- The expectation is again seen 
to vanish when the sign of a^k is random, but now it becomes clear that the stan- 
dard deviation will scale like s. Since is also proportional to s, this implies that 
the associated stochastic Pauli channel is only a really exacting substitute when the 
number of qubits being considered is large. 

An identical argument holds for Y and Z errors, showing that, for large n, error 
models for which the sign of the coherent error is random are well approximated by 
their associated stochastic Pauli channel. Conveniently, this restriction is preserved 
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under any local relabeling of the Pauli bases and therefore applies equally well to the 
original error operators. Examples satisfying the restriction include all stochastic 
errors, which have no coherent component, and unitary rotation errors where under 
and over rotation are equally likely. Systematic errors, such as amplitude damping 
or a bias towards over rotation, are not well modeled, though, in practice, the local 
relabeling of the Pauli bases imposed by gates will randomize these errors somewhat. 



84 



Chapter 3 

Channel Dependency of the 
Threshold 



The approach to fault tolerance presented in the background material was built 
around the idea that quantum gates fail independently, producing an arbitrary error 
on the involved qubits. Consequently, it was necessary to construct circuits satisfying 
the strictures of fault tolerance for any conceivable Pauli error since a single gate 
failure might entail Pauli errors of any sort on the participating qubits. To simplify 
the analysis, it was subsequently assumed for the analytical lower bound on the 
threshold that all gates failed with the same probability. In numerical estimates of 
the threshold, it is generally additionally assumed that all Pauli errors that might 
result from a single gate failure are equally likely. These are common, reasonable 
assumptions, but they do not reflect the diversity of error models that appear in 
actual physical systems. We might therefore ask the question, "What effect do these 
assumptions have?" 

It is certainly possible to reduce the threshold for quantum computation by dis- 
carding some of the standard assumptions, e.g., independence. Instead, in this chap- 
ter I investigate whether the threshold can be increased by modifying the standard 
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assumptions about the error model. The error model is a a promising candidate for 
modification since it is typically chosen for generality and convenience rather than 
performance. 

An analytical threshold bound for an arbitrary error channel would be impractical 
due, if nothing else, to the complexity of the result. Given a quantum code and fault- 
tolerant procedure, however, the effect of any particular stochastic Pauli channel on 
the threshold can be investigated numerically using a Monte-Carlo routine simply 
by varying the probabilities of different errors in the simulation. This approach 
has been used previously to study the dependence of the threshold on the relative 
probabilities of various gate failures [53]. I chart a somewhat more involved course 
by first tailoring a fault-tolerant procedure to a particular error model and then 
investigating the concomitant threshold. Knill [3(J] has produced impressive results 
using this kind of approach for error models with heralded errors. 



3.1 Threshold Estimation 



The simulation that yields the bulk of the data for this chapter is a Monte-Carlo error- 
propagation routine of the general sort introduced in Section 2.7.1. The program 
used here is specialized to the Steane code and includes functions implementing all 
of the operations necessary for applying Pauli error propagation on encoded Clifford 
gates laid out following Steane's method of fault-tolerant quantum computation. The 
circuits corresponding to these functions (as well as a few additional ones) are given 
in Section 2.6; in the unspecialized procedure, each syndrome is extracted twice. 

The threshold estimation portion of my code prepares 16 logical qubits in the 
state |0) and then enters a loop which randomly applies an encoded gate from the 
generating set {H, S, '"X} of the Clifford group, checks for an encoded failure, pre- 
pares anew any failed logical qubits, and repeats. The encoded failure rate for the 
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Figure 3.1: The encoded X error probability p versus the unencoded gate error 
probabihty p. The unencoded error channel here is depolarizing, and the code and 
fault-tolerant method used are those of Steane. For reference, a diagonal line demar- 
cating the break-even point for encoding is drawn in red. The intersection of this 
line with the blue curve fitting the data gives a depolarizing threshold estimate of 
PDth = 0.001. Error bars fit within the dots. 

gate of interest is estimated by counting the number of times the gate is applied 
between each time it fails. Statistics are taken for this counting data, and the loop 
exits when the variance in the average reaches the target value. The output of my 
simulation for the case of Steane 's method and a depolarizing error channel is plotted 
in Figure 3.1. As in all subsequent plots, only data for the encoded '"X gate is shown 
since its encoded error rate is roughly a factor of two greater than either of the other 
two gates. 



3.2 Symmetric Two-qubit Error Channel 



In place of the depolarizing channel, I consider a symmetric two-qubit-gate error 
channel, that is, an error model such that errors are generated exclusively through 
the failure of two-qubit gates (which, in this chapter, means ^X gates) where the 
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Figure 3.2: The accurate reporting property of '"X gates which generate exclusively 
symmetric errors. The identities in a) and b) show that a.n X ® X or Z ^ Z error 
after a '"X gate is equivalent to a single error prior to it, implying that ^'X gates used 
in error extraction never misreport the state of the data. 



generated error can be decomposed into the Pauli operators X ® X, Y Y, and 
Z^Z. 

This error model is not chosen for its physical plausibility. Indeed, I consider it 
unlikely that the ^X gate should be implemented in such a fashion that symmetric 
two-qubit errors dominate. Rather, my choice of error model reflects an attempt to 
pick a non-trivial stochastic Pauli channel for which real gains might plausibly be 
expected in the threshold. 

The symmetric '"X error channel is a promising candidate for improving the 
threshold because the correlated errors generated by a gate failure are exactly those 
that would be produced by the propagation of single-qubit errors initially present on 
the input qubits. Since '"X converts single-qubit errors to symmetric two-qubit errors, 
these errors form a kind of default two-qubit-gate error set. Moreover, symmetric 
errors interfere minimally with '"X gates used for error extraction since, even in the 
event of a failure, the indicated error is actually on the data. Figure 3.2 illustrates 
this useful property. 



3.3 Tailored Fault-tolerant Procedure 



Starting from Steane's method, this section lays out a modified fault-tolerant proce- 
dure tailored to combat symmetric two-qubit '"X errors. 
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Given the fact, discussed in the previous section, that X gates which fail sym- 
metrically always accurately report data errors, it makes sense to try to reduce the 
number of times that ancillae are coupled to the data for the purpose of extracting 
error information. In order to do so, however, the quality of the logical basis states 
used in Steane-style syndrome extraction must be improved; repeated syndrome ex- 
traction guards against massively faulty ancillae as much as against faulty coupling 
gates. Ancillae free (to first order) of correlated errors might be produced using the 
method discussed in Section 2.6.5, but instead I describe how especially low-error 
ancillae can be produced by taking advantage of our knowledge of the error model. 

I employ two basic tricks for constructing high quality ancillae, each relying on 
the fact that only a subset of Pauli errors can result from most circuits. The first 
is to design circuits such that some fraction of the possible Pauli errors are, in fact, 
stabilizers, a trick epitomized by the circuit for constructing Bell states shown in 
Figure 3.3a. Since X X, Y ® Y, and Z ® Z aW. stabilize l/^oo), this circuit is 
completely unaffected by a symmetric error on the ^X. The same trick also plays 
a role in the cat-state construction circuits of Figure 3.3b and Figure 3.3c where 
many symmetric '"X errors are equivalent to some combination of the stabilizers 
{ZiZj, X'^^}, where m is the size of the cat state and 1 < i, j < m and i ^ j. The 
design of these circuits is such that all other symmetric errors resulting from a single 
gate failure are detected by one or both of the measurements, so, conditional on 
positive measurement outcomes, the prepared cat states are, to first order, flawless. 
This is sufficient to insure fault tolerance for the Steane code since it corrects only 
single errors. 

The construction circuit that we need for Steane-style syndrome extraction, how- 
ever, is the one for |+) in part d) of Figure 3.3. This circuit takes advantage of both 
the techniques and the states generated in the other examples. That it generates 
the desired state, absent errors and conditional on positive measurement results, can 
be seen most easily by considering the evolution of the stabilizer. Subsequent to 
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preparing the two input cat states, a stabilizer generator for the state is 

{Z1Z2, Z2Z3, X1X2X3, Z4Z5, Z^Zq, ZqZ-i, X^X^XqXj} (3.1) 

To account for the measurement of +1 for X2X5X7, it is sufficient to take products 
of stabihzer generators until only a single generator anti-commutes with X2X5X7 
and substitute X2X5X7 for it, yielding, for example, 

{Z1Z3, Z1Z2Z4Z5, Z1Z2Z5ZQ, ZiZ2ZqZi, X1X2X3, X^X^XqXt, X2X5X7}. 

(3.2) 

Doing the same for XiXqXj gives 

{Z2Z,Z,Z„ Z1Z2Z5ZQ, Z2Z2,ZqZt, X1X2X3, X4X5X6X7, X2X5X7, XiXqX-i} 

(3.3) 

which is equivalent, under products of the stabilizer generators, to 
{ZiZ^Z^Zj, Z2Z3ZqZj, Z^Z^ZqZj, 

(3.4) 

X1X3X5X7, X2X3X6X7, X4X5X6X7, X1X2X3} 

the standard generator for |+) for the Steane code. The remaining two measurements 
are needed only for error detection; being measurements of stabilizers of |+) they do 
not, ideally, change the state. 

Verifying that the circuit in Figure 3.3d is robust against any single symmetric 
'"X failure is more complicated than for the Bell or cat states since the state |+) is 
partially formed by the measurements. Given perfect input ancillae, which, to first 
order, we have, it is clear that any single X ^ X or Y ®Y error will be detected by 
one of the measurements on the lower qubits of the Bell states. A Z ®Z error on any 
of the first eight '"X gates, however, will flip the result of one or both of the initial 
two stabilizer measurements, causing us to retain the wrong state. The state that we 
actually produce can be written in terms of an undamaged copy of |+), the residual 
error on |+) from the gate failure, and the errors ZiZ^ and Z2Z3 which account 
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Figure 3.3: Circuits for constructing a) \Pab), b) the 3-qubit cat state, c) the 4-qubit 
cat state, and d) logical |+) for the Steane code, such that any single symmetric 
error on a '"X is either harmless or detected. As before, the overtop tilde is used to 
denote logical qubits encoded in the classical repetition code. 



for our mistaken selection. The errors Z1Z3 and Z2Z3 are an appropriate choice for 
representing the result of retaining the wrong state, since they both commute with 
all of the generators in Equation (3.1) or, in other words, with the stabilizers of the 
input state, but each anti-commutes with one of the two measurements. Thus, 



(+ 


ZiZ^XiXqXjZiZ^ 


+) = 


-(+|XiX6X7|+) = -l 


(+ 




+) = 


(+1X2X5X71+) = ! 


(+ 


Z2Z3X1XQXJZ2Z3 


+) = 


(+IX1X6X7I+) = 1 


(+ 




+) = 


-(+1X2X5X71+) = -! 



(3.5) 



A Z ® Z error on the first X in Figure 3.3d, for example, would yield the state 

Z2 Z2Z3 1+) = Z3I+) (3.6) 
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Figure 3.4: Logical syndrome extraction circuits for a) X and b) Z errors. Extraction 
follows Steane's method, but no repetition is necessary due to the quality of the 
ancillae employed. 



half-way through the circuit, but a single Z error on the third qubit would be detected 
by the error checking portion of the circuit. In a similar way, it can be shown that 
a, Z ® Z error on any single '"X gate in the first half of the circuit is detected. A 
Z ® Z error on a single ^X gate in the second half of the circuit flips some certain 
measurement outcome and is thus easily detectable. 

The upshot of the preceding paragraph is that |+) states constructed using the 
circuit in Figure 3.3d are not only free of correlated errors to first order, which was 
our primary goal, but are, in fact, free of any errors whatsoever. The availability of a 
transversal encoded Hadamard gate means that the same is true for |0). Combining 
such ancillae with the accurate copying property ensured by our error channel, it 
makes sense to reduce the number of syndrome extractions in Steane's fault-tolerant 
procedure to one per kind of error. In this modified procedure, the logical circuits 
for performing X and Z syndrome extraction are simply those given in Figure 3.4. 



3.4 Bounding the Threshold for Symmetric Errors 



The method of analytically bounding the encoded error rates introduced in Sec- 
tion 2.7 applies equally well to the error model and fault-tolerant procedure just 
described. The only modification necessary is to restrict the space of possible errors 
to the symmetric '"X errors. 
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The question of a bound on the threshold is somewhat more subtle, however, 
since the error model describing the encoded gates will not be that of symmetric '"X 
errors. In lieu of the more standard restriction, that the encoded error probability be 
less than the unencoded error probability, I require that the encoded error probability 
for this procedure and error model be no greater than the threshold for more general 
kinds or errors. Thus, psth, the threshold for symmetric errors, is bounded below by 
the symmetric error probability ps such that 



where G is the number of malignant pairs in the my most heinous exRec and pAth 
is the threshold for adversarial errors in a more general fault-tolerant procedure. In 
what follows, I actually use a slight modification of Aliferis's counting method suited 
to the case that X ^ X, Y and Z Z are all equally likely; I replace Gpg with 
i^Pg/9 where K counts different kinds of malignant errors at a given pair of locations 
as different malignant pairs. 

The problem of counting malignant pairs is greatly simplified by the fact that 
my ancilla construction routines all output perfect states when success is indicated 
and fewer than two gate failures have occurred. A single failure occurring during 
the construction of a Bell, cat, or |+) state is of no consequence whatsoever. Thus, 
any malignant pair of errors involving ancilla construction must both appear in the 
construction procedure. 

There are no such pairs in the Bell state circuit. For the n-qubit cat state circuit, 
however, there are 



pairs that result in some kind of error on the state; I assume pessimistically that 
all of these are fatal. Given that there are no errors on the input states, the circuit 
for constructing 1+) includes 68 or fewer malignant pairs, so, including cat state 



GPs = PAth 



(3.7) 




(3.8) 
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Figure 3.5: The exRec for '"X in my fault-tolerant procedure. 



construction, the total number of malignant pairs in the construction circuit for |+) 
is 108. The probability of occurrence for these malignant pairs is actually bigger 
than 108pg/9 since we are discarding events in which detectable errors are produced, 
but for ps ^ 0.01 the difference here is slight. 

For this fault-tolerant procedure, the exRec with the largest number of malignant 

pairs is the one for the encoded gate. Ignoring ancilla errors, this exRec, shown 

in Figure 3.5, has a total of 

/7 X 9\ , , 

9 X = 17577 (3.9) 



2 

possible distinct error pairs. Counting the number of malignant ones with any kind of 
accuracy is not an easy mental exercise, but a short Python program hacked together 
for the purpose yields up the answer 3927. Thus, including ancilla construction, the 
total number of malignant pairs in the exRec for is 4791, and the corresponding 
bound on the threshold is given by 

532.3pith > PAth- (3.10) 

The information content of the bound in Equation (3.10) depends strongly on the 
value of pAth- For pAth = 10^^, it tells us that the threshold for symmetric '"X errors 
is more than an order of magnitude larger than the threshold for general adversarial 
errors. For pAth > 1/532.3 ~ 0.19%, it tells us nothing at all. An approximate 
number for this bound can be obtained by replacing pAth with pDth = 0.001, the 
numerical estimate of the depolarizing threshold obtained in Section 3.1. Doing so 
yields 

Psth > VpDth/523.3 > 0.0014 = 1.4pDth, (3.11) 
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a rather small improvement. Even this estimate of the bound is overly optimistic, 
however, analytical threshold bounds in the literature range as high as 1.94 x 10~^ [1] 
and estimates placing the threshold above 0.2% are already common [44, 29]. 

These analytical results do not bode well for my program of tailoring fault-tolerant 
procedures, but the lower bound given here is not tight, due both to my approximate 
treatment of ancilla errors and to the nature of the exRec method. For further 
evidence, I turn in the following section to numerical techniques. 



3.5 Estimating the Threshold for Symmetric Er- 
rors Numerically 

As for the analytical estimate, a numerical estimate of the threshold for symmetric 
errors requires that the encoded error rate for the tailored procedure be referenced 
to a more general threshold result, the implied structure being that of a tailored 
fault-tolerant procedure implemented at the first level of encoding combined with a 
more general procedure at higher levels. 

I determined the error rates for the initial level of encoding numerically using the 
simulation described in Section 3.1. To adapt my code for depolarizing threshold 
estimation to the case of threshold estimation for symmetric '"X errors using a tai- 
lored fault-tolerant procedure, I simply changed the input error model and defined 
new functions for ancilla construction and error correction based on the circuits of 
Section 3.3. 

Figure 3.6 shows the encoded failure probability of the '"X gate as a function 
of ps, the probability of a symmetric '"X error. The horizontal line on the graph 
marks the value of the depolarizing threshold calculated in Section 3.1. In order 
for subsequent levels of encoding to succeed, the encoded error probability after 
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Figure 3.6: The encoded error probability p versus the unencoded error 
probabihty ps- The unencoded error channel here is that of symmetric '"X errors, 
and the fault-tolerant procedure used is that described in Section 3.3. For reference, 
a line marking my earlier estimate of the depolarizing threshold is drawn in red. The 
intersection of this line with the blue curve fitting the data gives an estimate of the 
threshold for symmetric errors of psth ~ .0017. Error bars fit within the dots. 

the initial level of encoding must be less than the threshold for the new encoded 
error model. Taking the new error model to be depolarizing^, this restriction yields 
pgth ^ 0.0017 as an estimate of the threshold for symmetric errors on my modified 
fault-tolerant procedure. This estimate agrees reasonably well with the conclusions 
of the previous section. Only a small improvement, less than a factor of two, is 
achieved by my tailored fault-tolerant procedure. 

Increasing the threshold by a factor of two is not without merit, but my investiga- 
tions up till now have indicated this improvement only for the case of an error model 
with exclusively symmetric ^X errors. That a physical implementation of a quantum 
computer should suffer primarily from symmetric '"X errors is perhaps unlikely, but 
that such a device should suffer from them exclusively is unthinkable. To be useful 

^Even taking the encoded error model to also be symmetric yields only a total factor 
of 2.6 increase in the threshold. 
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Fiffure 3.7: Plots of the encoded error rate p for my tailored procedure (in 
blue) and the standard Steane-style procedure (in green) for various values of the 
symmetric error rate ps and the portion of the depolarizing channel added A. The red 
horizontal plane marks the estimate of the depolarizing threshold obtained earlier, 
so gates with error rates above that are unlikely to benefit from further encoding. 



at all, my tailored procedure must be robust against small perturbations in the error 
model. 

To investigate the stability of my procedure, I added a variable depolarizing 
component to the symmetric '"X error model in my code. The amount of depolarizing 
error was quantified by the parameter A where 0<A<1. A = gives the 
symmetric error model and A = 1 gives the standard depolarizing model. The 
encoded error rates for values of A and ps such that < A < 0.08 and 0.0085 < 
Ps ^ 0.00165 are plotted in Figure 3.7 for both my tailored and the standard fault- 
tolerant procedures. From Figure 3.7 it can be seen that, for values of ps greater than 
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Figure 3.8: Fits of the data in Figure 3.7 displayed as a density plot of the difference 
between the encoded '"X error rate for my tailored procedure and that of Steane. 
Green indicates that my procedure has a smaller error rate, while red favors Steane. 
The magenta line marks the fit of the threshold for my procedure, and the yellow 
line likewise marks the fit for Steane's. 



0.00085, the standard fault-tolerant implementation quickly overtakes my tailored 
procedure as the amount of depolarizing error is increased. An advantage is obtained 
only when the strength of the depolarizing errors is less than roughly 4% of what it 
would be for a depolarizing channel given the size of the symmetric '"X errors. From 
Figure 3.8, which displays a fit of the data using a density plot, we see that, compared 
to the standard procedure, the largest increase in the threshold over this range was 
less than 35%, showing that much of the observed improvement for symmetric '"X 
errors was not due to changing the fault-tolerant procedure at all. 




Chapter 3. Channel Dependency of the Threshold 



98 



3.6 Analysis 

The results of my effort to tailor Steane's fault-tolerant method to a particular error 
model were largely negative. In ideal circumstances a small increase in the threshold 
can be achieved, but perturbations of the error model on the order of only 5% of the 
symmetric '"X error probability are sufficient not only to wipe out the improvement 
but to reduce the effectiveness of the procedure below that of the standard approach. 

In retrospect, the narrowness of the window for improvement is unsurprising since 
the construction routine for |+) is not fault tolerant against general errors. In order 
for the procedure to function as designed, non-symmetric first-order errors must be 
negligible, which is to say that they should occur with probability of at most or 
so. For very small values of ps this is a physically unreasonable asymmetry to expect 
in the probabilities of various errors, and for much larger values of ps there is, as 
we have seen, little advantage to be had from my procedure. 

More interesting is the reason for the rather small increase of the threshold with 
the introduction of improved ancillae. Naively, it would seem that utilizing ancillae 
that are, to first order, perfect would be advantageous, but the analysis of this chapter 
appears to suggest otherwise. 

To settle this point I performed one final simulation to determine the encoded 
error rate for '"X using Steane's fault-tolerant procedure and the full depolarizing 
error model but with the ancilla error probabilities all set to zero. The output of 
this simulation is displayed in Figure 3.9 along with a line marking the threshold 
for subsequent encoding. The estimate for the depolarizing threshold associated 
with this case, where ancillae are absolutely perfect, is 0.0012, only 20% bigger than 
the depolarizing threshold I obtained for ancillae constructed using the prepare and 
discard method. 

This fact suggests an interesting idea. The best threshold estimates employ 
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0.00025 0.0005 0.00075 0.001 0.00125 0.0015 0.00175 
P 



Figure 3.9: The encoded p error probability versus the unencoded X error 
probabihty p. The error channel here is depolarizing, but the ancillae are assumed 
to be flawless. The code and fault-tolerant method used are those of Steane. For 
reference, a line marking my earlier estimate of the depolarizing threshold is drawn in 
red. The intersection of this line with the blue curve fitting the data gives an estimate 
of the threshold for depolarizing errors and perfect ancillae of psth ~ 0.0012. Error 
bars fit within the dots. 

construction routines that discard large numbers of qubits during ancilla construc- 
tion [29, 44], so the verified ancillae are quite good. Consequently, ancillary errors 
are likely to play a less dramatic part in threshold estimates than one might expect. 
Taking this idea to the extreme, one might contemplate new methods of estimating 
the threshold in which ancillae play only a minor role. Indeed, the topic of the next 
chapter is exactly that. 
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Chapter 4 

Thresholds for Homogeneous 
Ancillae 

Based on the observations of Chapter 3, it is uncertain how much impact ancilla 
construction has on the threshold for fault-tolerant quantum computation, but it is 
clear that improved ancillae have very little effect on the method of procedure spe- 
cific threshold estimation employed there. This is an intriguing finding (though not 
entirely unheard of [44]) given the amount of effort that is expended on designing 
construction procedures for various kinds of ancillary states, since it implies that 
threshold estimation might be accomplished without any reference to ancilla con- 
struction whatsoever. Ancilla construction is a messy affair to treat analytically, so 
the option to basically ignore ancillae also provides hope that approximate values of 
the threshold for selected procedures might be obtained analytically. In this chapter 
I describe such an analytical method and use it to investigate the relative merits of 
a selection of error models and fault-tolerant procedures. 
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4.1 Assumptions 

Some large part of the variance in fault-tolerant threshold calculations is due to the 
variety of assumptions employed by various authors. In an effort both to combat 
confusion and to facilitate comparisons, my assumptions for this chapter are listed 
below in roughly the order of decreasing novelty. 

1. All ancillary qubits have independent, identical error distributions. 

2. There are no memory errors. 

3. '"X is the only two-qubit gate. 

4. Any pair of qubits can interact via a two-qubit gate. 

5. Error operators are trace preserving and lack systematic coherent terms. 

6. Gate failures are uncorrelated. 

7. Classical computation is freely available. 

Assumptions 1, 4, 5, 6, and 7 are necessary for my analysis; the others are convenient 
but optional. Assumption 2 obviates the need to consider questions of parallelism, 
gate timing, and the speed of classical computation, while Assumption 3 reduces the 
number of cases that must be considered. 

4.2 Thresholds for Homogeneous Methods 

Two important observations from Chapter 2 provide the foundation for my method 
of threshold calculation. The first is that fault-tolerant procedures for CSS codes 
are, to a large degree, transversal. The second is that, for the kind of CSS codes 
typically employed, these transversal operations can be implemented by applying 
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the same gate to every qubit in a block. The sum of these observations is that most 
operations performed in a fault-tolerant procedure consist of doing the same thing 
to each of the qubits in a block. If one could arrange for all operations to have 
this property, which I refer to henceforth as homogeneity, analyzing the behavior of 
fault-tolerant circuits would be greatly simplified. 

Three components of the typical fault-tolerant method stand in the way of full 
homogeneity: ancilla production, syndrome extraction, and recovery. Starting from 
the perspective of threshold estimation, this section addresses each of these aspects, 
partly by keeping in mind that the eventual goal is to model errors, not computation. 
Ultimately, the method derived is meaningful both as a form of threshold estimation 
and as a threshold bound for idealized resources. 

4.2.1 Ancillae 

The ancillae used in CSS-code fault-tolerant procedures are typically prepared in 
highly entangled states, i.e. in logical basis states. By definition, entangled states 
cannot be constructed without the interaction of the constituent parts, so there is 
no a priori reason to think that the qubits composing an ancilla will have either 
independent or identical error distributions. For fault-tolerant procedures, however, 
the production of an entangled ancilla is usually followed by a homogeneous verifica- 
tion circuit, so one might expect that most of the residual error probability (of the 
kind tested for) arises during this verification step. With this is mind, I approximate 
ancillae as having uniform error distributions. 

It is important to realize that this assumption is far less innocuous than it sounds. 
Implicitly, I am assuming that ancillae of the desired size can be constructed for use in 
a verification circuit, but in subsequent sections I take the limit n — > oo. Thresholds 
given in this limit are only practically achievable if an efficient procedure exists to 
prepare logical ancillae. To be efficiently scalable, however, a construction routine 
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must have nonvanishing probability of generating an ancilla that has good fidehty 
with the desired state. Fault-tolerant schemes using concatenated codes provide a 
method of achieving this for arbitrarily large ancillae, but as a side effect of universal 
quantum computation. At present, there is no known method of preparing a logical 
qubit encoded using a CSS code of arbitrary size that does not depend on the ability 
to perform universal quantum computation. Thus, absent an explicit recipe for 
ancilla preparation, the algorithm presented in this chapter does not constitute a 
constructive procedure for achieving any threshold. 



4.2.2 Error Location 



Current techniques for locating errors require performing a complicated and dis- 
tinctly non-homogeneous function on the output of ancilla measurement. But while 
this classical processing requires knowledge of all the measurements, its effect, as- 
suming that no more than the correctable number of errors has occurred, can be 
described in terms of the individual qubits. So long as the total number of errors 
present on a measured ancilla is less than half the minimum distance^, the effect 
of the classical processing is to determine a subset of the measured bits that can 
be flipped to yield an undamaged codeword. For the purposes of error correction, 
knowing this string is equivalent to knowing the location of all the errors. While 
the second kind of information is not directly available to a quantum computer, it is 
quite accessible to a theorist treating errors probabilistically. I can therefore model 
the effect of classical processing in two steps. First, I determine whether too many 
errors have occurred on a block to permit proper decoding, and, if this is not the 
case, I treat the location of bit flips on the measured qubits as revealed. 

I have reduced the process of error location to a non-homogeneous failure check 

^Some higher weight errors will also be correctable, but I only lower my threshold by 
ignoring them. 
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and an arguably homogeneous revelation step. For a Monte-Carlo simulation, the 
failure check would consist of polling all of the other qubits and counting up the 
number of errors that have occurred to see whether they exceeded half the minimum 
distance of the code. If instead we performed a probability flow analysis, the expected 
probability of passing the check would be simply 



where n is the number of qubits in the block, t is the maximum number of errors 
that can be corrected with certainty by the code, and pc is the probability that a 
particular qubit has an X error at some location (step) C, here chosen to be just 
after the time of measurement. 

Equation (4.1) suggests a way of recovering full homogeneity. Letting r = t/n, 
in the limit of large n, Equation (4.1) becomes 



which is again homogeneous from the perspective of a simulation. As an added 
benefit, it is no longer necessary to concatenate many layers of coding to achieve a 
rigorous threshold; instead a vanishing error probability is achieved as the limit of 
a very large code. This alternative to concatenation is known as large block coding 
or, simply, block coding. 

The preceding paragraphs demonstrate that, for homogeneous (independent, iden- 
tically distributed) errors, whether or not an encoded state on a large number of 
qubits fails is determined by the error probability of an individual qubit. For this 
result to be useful, an infinite family of CSS codes with nonvanishing fractional min- 
imum distance must exist. Fortunately, it has been shown [51, 14] that CSS codes 
exist such that r ^ 5.5% for asymptotic values of n. It is not known whether a 
similar claim can be made for CSS codes in which the encoded phase gate can be 
implemented transversally, but this convenience is not necessary for my construction. 




(4.1) 




(4.2) 
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The analysis of this section assumes minimum distance error correction, but an 
identical result applies to error correction up to the channel capacity. Gottesman 
and Preskill [ ] have shown that families of general CSS codes exist that are asymp- 
totically capable of correcting errors up to r ~ 11%. Hamada [ ] has shown (my 
own version of this proof can be found in Appendix A) that this result applies to 
CSS codes with X-Z exchange symmetry as well. 

The appropriate choice for r depends on the purpose of the calculation. When 
the goal is to estimate the threshold that would be obtained by running a Monte- 
Carlo simulation of a minimum distance decoder, r should be chosen to be 5.5%, 
or, perhaps better still, the limit of the correctable error fraction as the number of 
concatenations of the code in question goes to infinity. To obtain the largest bound 
on the threshold for homogeneous ancillae or for comparison to threshold estimates 
that use the channel capacity, it is best to choose r = 11%. In other cases it may 
be desirable to choose a value of r specific to a family of quantum error correcting 
codes with special properties, such of syndrome decoding or the possession 

of low weight stabilizer operators. 



4.2.3 Recovery 

Having diagnosed the location of our errors, the obvious way of dealing with them is 
to apply to each qubit the gate which reverses its error. Such a recovery operation 
is inherently inhomogeneous since not all qubits will be in error, and thus not all 
qubits will have recovery gates applied to them. There are a number of ways to deal 
with this problem, but I take the approach, described in Section 2.6.1, of dispensing 
with recovery altogether. As explained there, the effect of Pauli gates on Clifford 
circuits can be efficiently dealt with through post-processing. 
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|0) 




^ 

Figure 4.1: A homogeneous sequence of operations on the Steane code and a single 
strand thereof. The inset on the right shows a single strand which has been extracted 
from the coding blocks. Gates between pairs of qubits bind then into the same strand. 
Modulo a label indicating the ancilla's starting state, the strand is identical to the 
encoded circuit. 



4.3 Error Counting 



In the previous section we saw how to modify fault-tolerant procedures based on CSS 
codes so that they are fully homogeneous. The advantage of doing this is that the 
error probabilities of qubits within an encoded block then become independent and 
identical. Blocks can therefore be separated into strands (see Figure 4.1), where each 
strand is made up of a single qubit of a block and all qubits that directly or indirectly 
couple to it. Since each strand is functionally identical, it suffices to determine the 
error spectrum for one of them; the probability of an encoded failure at any point can 
be predicted from the error probabilities of an individual strand of the transversal 
procedure. As the number of encoding qubits becomes large, the fraction of qubits 
with a particular error approaches the expectation for that error. In the limit that 
n —>■ oo, we can say for certain whether our procedure fails on any given step since, in 
that limit, the probability of an encoded failure becomes a step function. Thus, the 
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threshold is completely determined by the probability of an encoded failure, and the 
probability of an encoded failure is completely determined by the error probability of 
a single strand of the blocks. Therefore, in order to calculate the threshold I need only 
determine the error probability on a single strand at every point in the fault-tolerant 
circuit. This can be accomplished through a combination of error propagation and 
exhaustive bookkeeping which I describe in the following sections. 



4.3.1 Error Bookkeeping 



Given a gate, say the Hadamard, and a set of probabilities describing the likehhood 
of various Pauli errors, say px, Py, and pz for the errors X, Y, and Z, the post gate 
state can be written as a probabilistically selected pure state, such that 



XH\^) with probability p^ 
Fi^l^) with probability Py 
ZH\^) with probability p2 
otherwise. 



(4.3) 



The effect of applying further Chfford gates is to change, via error propagation, 
which Pauli error corresponds to each probability, and then to add additional layers 
of probabilistic errors. If, for example, we were to apply another Hadamard gate our 
state would become 



X|^) with probability Pz + Px + PyPz + P: 
with probability 2py + 2pxPz 
with probability Px +Pz + PyPx + P, 

\^) otherwise. 



(4.4) 



By repeated application of this process it is possible to determine the probability of 
various kinds of errors at any point in a circuit composed of Clifford gates. Armed 
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with this knowledge we can determine^ the hkehhood of an encoded failure or, in the 
infinite limit, whether an encoded failure will happen or not. 

4.4 Practicalities 

While the previous section presented the basic algorithm for determining whether 
an encoded failure occurs, this section deals with details of the error model and the 
implementation that must be considered in any actual application of the method. 

4.4.1 Applicable Error Models 

Like much of the work on fault-tolerance, the analysis throughout this chapter em- 
ploys stochastic Pauli errors. Its applicability, however, is not limited to that case. 
We found in Section 2.7.4 that, on average, coherent errors with random phases 
add like stochastic errors. The variance in that average turned out to be large, but 
the variance is suppressed as the number of samples increases, and, in this chap- 
ter, the number of samples is the number of qubits, which is taken to be infinite. 
Consequently, the method described here applies to any unbiased error model. 

4.4.2 Implementation 

In the examples that follow, single-strand error rates were determined for three fault- 
tolerant procedures. For each procedure, error rates were calculated for a encoded 
gate set, H, '"X, P, and T gates, as well as for an idle step that accounted for the 
possibility of changing the order of X and Z error correction. No checks were made 
on the encoded T gate following its first error correction, since the remainder of 

^For non-transversal circuits the number of terms in our bookkeeping rapidly becomes 
unmanageable as the size of the code increases. 
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the gate consists of applying the encoded P gate. The encoded P gate was assumed 
pessimistically to be implemented via a teleportation process akin to that used for T. 
For each encoded gate, the maximum was taken over the strand error probabilities 
at all measurement steps since in the limit that n — oo encoded failures are caused 
exclusively by the largest relevant error probability at a measurement location. 

The error probabilities for unencoded gates, measurements, and ancillae were left 
as free parameters, pr, Pah, Pm, Pat, and p^^ are used to denote one-qubit, two-qubit, 
measurement, A-type-ancilla, and 5-type-ancilla error probabilities where F ranges 
over the single-qubit Pauli errors and AS ranges over the two-qubit Pauli errors. 
Note that ancillae are labeled irrespective of what they encode. A-tjpe ancillae are 
used in locations where Z errors are more disruptive than X errors, and contrariwise 
for i?-type ancillae. In the absence of better information, I assume that A-tjpe 
ancillae are tested using a homogeneous coupling, with discard on failure, for first 
X and then Z errors; the opposite order is used for i?-type ancillae. In this case I 
approximate the ancilla error distributions as 

Pax = Pxz + Pxi + Pix + Pxx 
Pay — PiY "I" PxY 
Paz — Piz ~\~ Pxz 

(4.5) 

Pbx — Pxi ~\~ Pxz 
Pby — Pyi ~\~ Pyz 
Pbz = Piz + Pxz + Pzi + Pzz, 

which is (to first order) what one would expect if the only errors on a verified ancilla 
were due to undetectable errors on the ^X gates used to check it. 

It should be emphasized that the ancilla error probabilities given by Equation (4.5) 
are not the only possible choice. They were chosen as a good approximation to the 
residual error following a verification procedure that discards the state whenever a 
problem is indicated. Depending on the purpose of the calculation, it will sometimes 
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Error 

iViUUcl 


Nonzero Error Probabilities 




Pr = 45 Pas = Pm = f' 
Pax ~~ Pbz ~~ 4 7 Pay ~~ Paz ~~ Pbx ~~ Pby ~~ g 


#2 


Pt = if, Pas = Pm = 4^P, 
Pax — Pbz — '^j Pay — Paz — Pbx — Pby — 


#3 


Pas — 1^1 Pax — Pbz — ^f; 
Pay — Paz — Pbx — Pby — ^ 


#4 


Pix ~ Pxi ~ Piz ~ Pzi ~ 4; 
Pax ~ Pbz ~ Paz ~ Pbx ~ 4 



Table 4.1: Four reduced error models considered in the text and in Table 4.2. 
Pr, Pah, Pm, Pat, and Pbv represent various one-qubit, two-qubit, measurement, 
A-type-ancilla, and 5-type-ancilla error probabilities where F G {X, F, Z} and 
AS G {/, X, F, Z}®^/{/J}. Unspecified probabilities are zero. 



be more appropriate to assign, for example, higher error probabilities associated 
with less resource intensive verification or different probabilities for different kinds 
of ancillae. 



The Mathematica program that I use to calculate encoded error rates retains 
terms up to second order in the base error probabilities, but the results given in the 
following sections include only first-order terms. Second-order terms were found to 
be negligible for any plausible choice of error model. To understand why, consider 
a simplified error model in which gates can fail in only a single way. Let p be the 
probability of an individual gate failing, and let gi be the number of gates on which 
a single failure results in an error at location C Further, let g2 be the number of 
gates that might participate in some pair of failures to yield an error at location C. 
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The expected error at location C is then bounded by 

+ (^^^y^(i-py^-^ + oip') (4.6) 

=9iP - 9ii92 - l)p' + ji92- ly + Oip'). 

The inequahty arises from the fact that not all pairs of failures will necessarily 
produce an error at C Even ignoring that, however, the second-order terms will be 
negative unless g2 > 2gi] negative terms may safely be neglected since their omission 
only lowers the threshold. Among the examples of the following section, the double- 
coupling Steane procedure, when apphed to error model #1, has relatively large 
second-order terms. Yet the worst location in that procedure corresponds, roughly, 
to a single-error situation where gi = 13, (?2 = 27, and p < .018, for which the ratio 
of second to first-order terms is less than .02. Again, this does not even take into 
consideration the fact that many second-order errors will be harmless. 



4.5 Special Cases 

Having described the operation of my algorithm for calculating thresholds, 1 now 
apply it to three cases of interest. Two of these are variants on the fault-tolerant 
method of Steane, while the third case is a Knill-style fault-tolerant telecorrection 
procedure. The mechanics of syndrome extraction for both Steane- and Knill-style 
procedures was described in Section 2.6.4. 

The unrefined output of this endeavor is the set of maximum strand error proba- 
bilities listed in Table 4.1. This table specifies a kind of high-dimensional threshold 
surface in the space of generic stochastic error models. An error channel is below 
the threshold for a particular procedure whenever the maximal strand error proba- 
bilities for that procedure are lower than the fraction of errors that are correctable 
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Thresholds for Homogeneous Ancillae (r) 


— -~----_____Error Model 
Procedure ~ — — 


#1 


#2 


#3 


#4 


Single-coupling Steane 


0.15 


0.06 


0.24 


0.29 


Double-coupling Steane 


0.16 


0.10 


0.18 


0.29 


Knill 


0.35 


0.15 


0.50 


0.67 



Table 4.2: Thresholds for ancillae with homogeneous errors given in units of r, the 
correctable error fraction, for the three procedures and four error models I consider 
as examples. These thresholds were obtained by substituting the parameters given 
in Table 4.1 into Table 4.1 and requiring that single-strand error probabilities not 
exceed r. 

asymptotically. 

For the purpose of illustration, however, it is more useful to consider less com- 
plicated error models. Table 4.1 defines four reduced error models in terms of the 
generic stochastic error model. Since these reduced error models have only a single 
free parameter, their threshold surfaces are simply numbers. Table 4.2 lists thresh- 
olds for three procedures and four reduced error models in terms of r, the asymptotic 
correctable error fraction. 

The following sections provide supplemental information specific to each proce- 
dure, including qualitative reviews of the procedures, circuit diagrams for encoded 
gates, and commentary on the thresholds given in Table 4.2. 

4.5.1 Steane's Method 

Typical instantiations of Steane's method employ multiple extractions to guard 
against errors made during the coupling and measurement process. Often [53, 44, 57] 
the number of extractions performed is conditional on their output. I deviate from 
this rule by demanding a fixed number of couplings. This a sensible choice for my 
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Figure 4.2: Encoded circuits for the single-coupling Steane procedure. Handling of 
the data is minimized at the cost of syndrome verification. Parts a and b display the 
circuits for finding X and Z errors, respectively. Part c) lists the circuits analyzed 
to determine the encoded error rates for this example. 



analysis since, up to rearrangement of qubits, the output of an extraction becomes 
deterministic as the number of qubits in an encoding approaches infinity. Moreover, 
if p is the probability of an error occurring, requiring the sequential agreement of j 
extractions reduces the probability of misdiagnosing an error on a particular line to 
order jp . Since I ultimately retain only first-order terms I need only consider single 
and double-coupling Steane procedures. 

Single- coupling Steane Procedure 

Two-qubit gates require a controlled interaction between two otherwise isolated quan- 
tum systems. Consequently, they are often the most error prone gates in a universal 
set. In such cases, the factor limiting the probability of successful error correction 
may be the number of times that two-qubit gates must be applied to the data in 
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order to reliably diagnose errors. For Steane's method, this interaction is minimized 
by coupling to the data once per X correction and once per Z correction, as shown 
in Figure 4.2. 

Table 4.2 shows that, relative to the other procedures considered, the single- 
coupling Steane procedure performs most strongly for error model ^^3. This is in 
line with our expectations since model #3 includes only two-qubit gate errors and 
the resultant ancilla errors. Surprisingly, it also does rather well overall, suffering in 
comparison to the double-coupling Steane procedure only for error model 7^2 where 
measurement errors dominate. The single-coupling Steane procedure lacks a means 
of syndrome verification, so any errors in syndrome measurement are transferred 
directly to the data. Nevertheless, my results demonstrate that moderate single-qubit 
and measurement error probabilities can be tolerated when high quality ancillae are 
available. 

Double-coupling Steane Procedure 

When two-qubit gates are relatively reliable, the damage done during the extraction 
of error information can be limited by preparing ancillae such that they include few 
errors capable of propagating to the data. Under these circumstances, it is often 
advantageous to verify error diagnoses by coupling to the data more than once, as 
shown in Figure 4.3. 

For the double- coupling Steane procedure, error model #1 is especially interesting 
because it was chosen in imitation of the error model used by Reichardt [M] in his 
numerical estimation of the threshold for a Steane style procedure on a 49 qubit code. 
My threshold of 0.90% for asymptotic minimum distance decoding is quite close to 
his value of roughly 0.88%. The extraordinary agreement of these two estimates is a 
coincidence, as can be seen from my discussion of finite codes in Section 4.6, but their 
rough equivalence illustrates the value of my idealized algorithm for approximating 
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Figure 4.3: Encoded circuits for the double-coupling Steane procedure. Syndrome 
information is extracted twice, and qubits implicated both times are presumed to 
be in error. Parts a and b display the circuits for finding X and Z errors, respec- 
tively. Part c) lists the circuits analyzed to determine the encoded error rates for 
this example. 



the encoded error rates used in threshold estimation. As the number of qubits 
grows, the threshold estimate obtained by Monte-Carlo simulation approaches that 
predicted my analytical method. In fact. Section 4.6 shows that reasonably good 
bounds can be placed on outcome of threshold estimates even for very small numbers 
of qubits provided that the fault-tolerant procedure is exactly implementable in my 
form. 

As expected, relative to the other two procedures, the double-coupling Steane pro- 
cedure performs most favorably for error model 7^2. Somewhat surprisingly, however, 
it still underperforms the Knill procedure. The reason for this is most easily under- 
stood by considering the limiting case in which only measurement errors occur. In 
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the absence of any other source of error, measurement errors have no effect on either 
the double-couphng Steane or the Knill procedure until their probabihty exceeds r; 
beyond that point both procedures fail with certainty. Thus, the two procedures 
cope with measurement errors equally well, but the Knill procedure handles other 
kinds of gate errors more effectively. 

Error models #3 and #4 demonstrate small gains in the threshold that can 
result when two-qubit gate errors have some underlying structure. Model 7^3 is a 
pure two-qubit-gate depolarizing error model (which includes the associated ancilla 
errors) while #4 is a model in which two-qubit gates malfunction by producing either 
an X or a Z error on either the control or the target. Given the highly restricted 
form of error model #4 it is discouraging that the threshold increases by less than a 
factor of two over that of error model #3. Though, in light of the already high value 
for the threshold in error model #3, it is perhaps unsurprising. 

4.5.2 Knill's Method 

For my implementation of Knill's method, error model #4 achieves the highest 
threshold, though physical systems displaying this sort of error seem unlikely. Er- 
ror model #1 provides another check of my algorithm, since its parameters are also 
roughly those used by Knill [29] in a paper on telecorrection. Setting r to 11% for 
the channel capacity for CSS codes, I find that the threshold for this model is 3.9% 
compared to Knill's estimate of 3% and his extrapolation of up to 5%. The approx- 
imate agreement between these values is satisfying, though an exact match is not 
expected since Knill assumes that errors on up to 19% of the qubits can be corrected, 
an assumption that derives from bounds on the channel capacity for general quantum 
codes [17]. 

Of course the most striking aspect of Table 4.2 is that the Knill procedure yields 
a higher threshold for every error model. As with the single-coupling Steane case. 
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Figure 4.4: Encoded circuits for the Knill procedure. Error correction is performed by 
teleporting the data, minus the errors, using an entangled two-logical-qubit ancilla; 
the precise location of errors is unimportant so long as the result of the logical- 
qubit measurement is correctly decoded. Parts a and b display the circuits for 
correcting errors when the output of the previous step was dominated by Z and 
X errors respectively. Part c) lists the circuits analyzed to determine the encoded 
error rates for this example. Ancilla error distributions are used as the input to these 
circuits since that is the only remaining source of error after a successful teleportation 
correction. 



this derives partly from my assumptions regarding ancillae. In particular, Steane's 
method was designed to utilize ancillae for which either correlated X or correlated 
Z errors could be minimized, but not both, a situation certain to favor his approach. 
A second but lesser objection can be made that I set the ancilla error probabihties 
equal for all gates and all methods, ignoring the fact that some methods, such as 
Knill's, and some gates, such as the T gate, will require more complex ancillae which 
may in turn be more error prone. Substantially more detailed ancilla information 
would be needed to evaluate the importance of this effect, but the overall character 
of my results is unlikely to change since that would entail in excess of a two-fold 
increase in the error probabilities for logical two-qubit ancillae over those for ancillae 
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prepared in a single-qubit logical state. Thus, so long as resource considerations do 
not limit our ability to discard suspect ancillae, and therefore to make very high 
quality ancillae, Knill's method will provide the highest thresholds. 



Prior to taking the limit n — oo, the expression for the probability of an encoded 
error at a location C was 



where t is the number of correctable errors and pc is the probability of a relevant error 
on a single qubit at the location in question. Using this expression, the programme 
of Section 4.2 can be implemented for finite n. In doing so, however, the simplicity 
of the algorithm suffers somewhat, and its interpretation as an idealized threshold 
bound is completely lost. Fundamentally, the complications that arise are all due to 
the fact that the success or failure of various portions of an encoded gate are no longer 
deterministic. This section explains how to deal with the associated difficulties and 
concludes with a brief demonstration of the algorithm for a [[49, 1, 9]] and [[7, 1, 3]] 
code. 

In the examples of Section 4.5, I establish a background error rate by performing 
an initial error correction, but for finite n this initialization is not guaranteed to 
succeed. Though the failure of the initial error correction is properly assigned to 
the previous encoded gate, the residual errors will differ dramatically depending on 
whether it occurred. This presents no problem when only a single level of encoding 
is employed since any encoded failure is considered a failure of the computation. In 
concatenated coding schemes, however, failed encoded qubits are corrected at higher 
levels of encoding. Their continued use is problematic since an encoded gate failure 
may be correlated with subsequent encoded failures. Nevertheless, I recommend 



4.6 Finite Codes 




(4.7) 
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calculating the encoded error rate for finite codes using the assumption that the 
initialization did not fail, a choice that requires no modification to the case for large 
n. 

Likewise, calculation of the single line error rate pc proceeds without modifica- 
tion. For finite codes, however, the maximum tolerable single line error rate becomes 
a nontrivial function of the encoded error rate that we wish to achieve. The proba- 
bility of an unrecoverable error never goes to zero, so it is necessary to perform the 
summation in Equation (4.7) to determine the portion of the encoded error rate due 
to any particular location. 

The possibility of failure must be considered at many points in the circuit since 
statistical fluctuations will produce unrecoverable errors at a variety of locations. 
Typically, encoded failure probabilities at various locations will be strongly corre- 
lated, but the exact nature of these correlations is difficult to predict. Thus, the best 
I can do is to bound the encoded failure probability. 



where S ranges over the locations of every post-initialization output, that is, syn- 
drome measurements and the final state of the data with regard to both X and Z 
errors. 

To clarify the changes outlined above, consider the example of the double-coupling 
Steane procedure implemented using a [[49, 1, 9]] quantum code and subject to the 
error channel defined by error model 7^1. For the encoded '"X gate, the set of 
single line error probabilities corresponding to X errors at the eight locations of 
post-initialization syndrome measurement and X and Z errors at the two output 
locations of the data is 




Encoded 




(4.8) 



error rate 




A7p A3p A3p Alp 39p 37p 37p 



8'8'8' 8'8'8'8' 
33p 9p 9p 3p 3p 



(4.9) 
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Solving Equation (4.8) subject to the restriction that the encoded error rate is exactly 
p yields solutions in the range 

0.0036 >p > 0.0023. (4.10) 

Repeating this process for each of the other encoded gates and taking the minimum 
over the upper and lower bounds produces a threshold of 

0.0034 > pth > 0.0025 (4.11) 

where, of course, the caveats discussed in Section 2.7.1 regarding concatenated 
threshold estimates all apply. This example provides a particularly apt compari- 
son to Reichardt's threshold estimate for the [[49,1,9]] code [1 1]. The threshold 
calculated here is roughly a third of that estimated by Reichardt. The difference 
presumably springs from the superiority of his rule for syndrome extraction when 
n = 49. 

Readers puzzled by the discrepancy between the threshold estimates obtained in 
this chapter and those calculated in Chapter 3 should be relieved to know that the 
equivalent bounds for Steane's [[7, 1, 3]] code are 

0.0013 > pth > 0.0003 (4.12) 

showing once again that, in addition to being approximate, the values of the threshold 
estimated by such methods are highly procedure dependent. As a consistency check, 
encoded *"X error rates obtained for exactly the same error model (excepting ancilla 
errors which are determined by the ancilla construction) and fault-tolerant procedure 
using the Monte-Carlo algorithm from Chapter 3 are plotted in Figure 4.5. From 
the figure, it can be seen that the threshold estimate falls within the range specified 
by Equation (4.12). The notable difference between the estimate of the depolarizing 
threshold here and in the last chapter arises because the error model used to represent 
the depolarizing channel here, chosen for comparison with Reichardt's work, actually 
produces fewer Pauli errors for a given value of p than that in the previous chapter. 



Chapter 4. Thresholds for Homogeneous Ancillae 



123 



P 



0.0005 



0.0015 



0.002 



0.001 




0.00025 



0.0005 



0.00075 



0.001 



0.00125 



0.0015 



P 



Figure 4.5: The encoded X error probability p versus the unencoded gate error 
probabihty p. The data was taken for Steane's code and the double-couphng Steane 
procedure using error model t^I, a rescaled depolarizing channel. For reference, a 
diagonal line demarcating the break-even point for encoding is drawn in red. The 
intersection of this line with the blue curve fitting the data gives a depolarizing 
threshold estimate of pDth = 0.0012. Error bars fit within the dots. 

4.7 Analysis 

The algorithm that I have described for generating thresholds can be viewed in two 
possible lights. First, it might thought of as a way of establishing rigorous bounds on 
the threshold for fault-tolerant quantum computation given the, admittedly elusive, 
resource of ancillae with independent, identically distributed errors. Second, it can 
be considered a fast, flexible method for establishing threshold estimates, yielding 
the estimate that would ultimately be obtained for a method given a large enough 
quantum code and sufficient computer time. 

My approach applies to most fault-tolerant procedures employing CSS codes. It 
relies on the fact that nearly all elements of such a procedure are homogeneous, that 
is, transversal with identical components. Inhomogeneous elements are either elimi- 
nated, as for classical syndrome processing and the application of recovery unitaries. 
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or, in the case of ancillae, replaced with homogeneous equivalents. This allows me 
to calculate the probability of failure for encoded gates in terms of the error proba- 
bilities associated with a single strand of the encoded blocks. In the limit that the 
number of encoding qubits approaches infinity, a criterion for success becomes sim- 
ply that the probability of finding an error never exceed the fraction of the encoded 
qubits on which said error can be corrected. When this is satisfied, it is possible, in 
the limit of infinite block size, to compute indefinitely, and our base error rates are, 
by definition, below threshold. 

The value of considering thresholds for homogeneous ancillae is that they can eas- 
ily be calculated for a variety of fault-tolerant procedures and error models, thereby 
providing a relatively simple metric for comparison. Section 4.5 includes thresh- 
olds for computation for three fault-tolerant procedures and four error models. One 
of the procedures considered is based on a method of telecorrection used by Knill, 
while the other two are variations, in that the number of syndrome extractions is 
fixed, on Steane's approach to achieving fault tolerance. The error models consid- 
ered are a full depolarizing error model, a depolarizing error model with increased 
measurement errors, a depolarizing error model for two-qubit gates exclusively, and 
a restricted two-qubit-gate error model. Holding the total probability of an error 
constant, small improvements are observed in the threshold for certain choices of the 
two-qubit-gate error model. For stochastic errors and CSS codes, improvements of a 
grander scale are unlikely because the threshold coefficient for the depolarizing chan- 
nel using Knill's procedure is already 1/3. For errors that actually reach the data, 
the largest threshold coefficient one would expect is 1 (though 2 might be achieved 
using multiple versions of gates with different highly unusual errors), so at best we 
might look for a factor of 3. Interestingly, I find by inspection that the threshold 
coefficient for measurement errors exclusively, which need not reach the data, is 1 
for the double-coupling Steane and Knill procedures. With regard to comparisons 
between procedures, the single-coupling Steane procedure is shown to outperform 
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the double-coupling procedure when two-qubit depolarizing errors dominate, but 
the double-coupling Steane procedure does notably better when measurement errors 
are likely. I also find that Knill's approach outperforms that of Steane for all error 
models considered, a conclusion that is likely to hold so long as correlated ancillary 
errors are rare and the ancillae needed for Knill's method are not appreciably more 
error prone than those employed by Steane. 

Idealized thresholds aside, my algorithm is useful as a means of approximately 
computing the logical error rate for a single level of encoding, which is an established 
method of estimating the threshold for quantum computation. The two treatments 
yield similar outcomes because numerical estimates of the encoded error rate typi- 
cally prepare ancillae in a way that maximizes their quality at the cost of additional 
resource overhead. Ancillae prepared in this manner have error distributions approx- 
imating my ideal of independent, identically distributed errors. The basic algorithm 
uses the infinite limit to obtain simple analytic results, but an alternative (and less 
rigorous) algorithm for finite codes is described in Section 4.6. Both methods were 
shown to yield results in rough accordance with the depolarizing threshold deter- 
mined by Reichardt [ ] for the [[49, 1,9]] code, and the finite version was found to 
yield reasonable bounds even for the case of Steane's [[7, 1, 3]] code. For telecorrec- 
tion, in the limit n oo, my estimate of the depolarizing threshold was consistent 
with the range of values determined by Knill [29] . 

Much further work remains to be done on this subject. One topic of interest 
is the rate of convergence of my threshold estimate ranges for finite codes with the 
estimates obtained in the large n limit. A second possibility is the extension of my 
analysis to include memory errors, which promises to be a straightforward, if unbeau- 
tiful, endeavor. The most valuable addition, however, would be to explicitly define 
methods of ancilla construction and determine the degree to which they differ from 
my ideal. Constructing ancillae to my specifications is an extremely difficult prob- 
lem, but one whose solution would have a strong impact on the theory of quantum 
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computing in general and this work in particular. A scalable method for producing 
ancillae with independent, identically distributed errors would enable the algorithm 
presented here to be employed for the calculation of rigorous lower bounds on the 
threshold without any caveats about idealized resources. Which is why I take up 
that problem in the following chapter. 
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Chapter 5 



Ancilla Construction 



In Chapter 3 I found little advantage to having a detailed knowledge of the Pauli error 
channel afflicting a system. This was due to a pair of facts. First, the error overhead 
due to the application of transversal gates is only marginally affected by the particular 
errors that happen to be generated by the gates. Second, ancillae prepared using 
a very liberal discard policy already have such low error rates that they have little 
effect on the threshold at all, so improving them is of no consequence. The second 
observation led me to investigate the threshold separately from ancilla construction 
in Chapter 4 where I found that large ancillae with homogeneous error distributions 
were indeed a valuable resource, sufficient to permit quantum computation at error 
rates on the order of a few percent. The construction of large, high-quality ancillae, 
however, is a difficult matter. One that I now turn to. 
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5.1 Graph States 

A graph state [15] is a stabilizer state for which there exists a set of stabihzer gen- 
erators, Sg = {A^, such that 

A^ = X, II Zk (5.1) 

where A/'(j) denotes the neighbors of node j on some graph. Using the binary 
representation of the Pauh group, such a set of generators has the form 



I B 



(5.2) 



where B is the adjacency matrix of the graph and, thus, a symmetric matrix with 
O's on the diagonal 

From this choice of stabihzer generators derives a particularly elegant preparation 
procedure for the associated state. The quantum circuit 

n %fcf^®"|0)^" (5.3) 

j<k\jeAf{k) 

prepares the n-qubit graph state associated with the graph described by A/". The 
origin of this circuit is most easily understood by considering the effect that the 
unitaries have on {Zh}h^i, the stabilizer generator of |0)^". Employing a few of the 
identities from Section 2.4, each stabilizer can be shown to transform to a new 
stabilizer 

nCy TT®n rv Tj®n I I Crv 

j<k\j€Af{k) l<m\leAf{in) 

(5.4) 

j<k\j(aM(k) l<m\l&M{m) keAf{h) 

which corresponds exactly to a stabilizer for the desired graph state, hence, the 
unitary prepares the graph state. Figure 5.1 provides an example of each of these 
representations of a graph state. 
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Figure 5.1: A a) graph, b) stabilizer generator, c) binary stabilizer generator, and d) 
preparation circuit for an example 6-qubit graph state. Nodes in a) are numbered 
sequentially starting from the upper left and moving clockwise. 



Much of the interest with regard to graph states in the literature has been focused 
on their suitability as a substrate for measurement based quantum computation [41]. 
It has been shown, for a variety of classes of graphs, that single qubit measurements 
are sufficient to implement an arbitrary quantum computation. For the purposes of 
this chapter, however, I am more interested in the fact, proven by Van den Nest [15], 
that, up to the application of local Clifford gates, the class of stabilizer states is 
equivalent to the class of graph states. I include my own proof of this below. 

Definition 1. For any stabilizer generator Sg = {A*}, let the term gap form refer 
to any column j such that A^^ = I for all i > j ■ 

Definition 2. For any stabilizer generator Sg = {A"^}, let the term pivot form refer 
to any column j such that A^^ G {X, Y} and A^^ G {/, Z} for all i ^ j ■ 

Definition 3. For any stabilizer generator Sg = {A^}, let the term echelon form 
refer to any column j in either gap or pivot form. 
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Lemma 4. Given an m-element stabilizer generator Sg = {A^} such that the first 
k — 1 < m columns are in echelon form, we can construct a stabilizer generator Sg 
such that the first k columns of Sg have echelon form and S and S" are equivalent 
up to conjugation by local Clifford gates. 

Either the kth column of S satisfies A'^'^ = I for all i > k, showing that it is aheady 
in echelon form, or there exists a.n i > k such that A*'^ ^ I. In the second case, 
by exchanging this row i with row k and conjugating by Hk as necessary, we can 
transform to a stabilizer generator Sg = {B^} such that -B^*^ G {X, Y}. Multiplying 
any other row (generator) of S' by preserves echelon form on the first k — 1 
columns since, for all j < k, either B^^ = / or B''^ = Z and column j is in pivot 
form, which is impervious to Z Pauli operators. Consequently, multiplying by B'^ 
each other row B' such that B^'' G {X, Y} yields a new stabilizer Sg whose first k 
columns are in echelon form. 

Theorem 2. Given an n-qubit, n-element stabilizer generator Sg = {A^}, we can 
construct a graph-form stabilizer generator Sg such that S and S" are equivalent up 
to conjugation by local Clifford gates. 

Applying Lemma 4 n times yields a new stabilizer in echelon form Sg = {B^}. 
Let j be the index of the last gap column of Sg, by definition B^^ = I for any i 
indexing a gap column. By assumption, row j is independent, so there exists an i 
such that -B-'* = Z, but this implies that row j and row i anti-commute since row i 
has an X or y at position i and Z's every else that row j might have Z's. Thus, 
by contradiction, we know that there are no gap columns and so every column in 
Sg has pivot form. Similarly, commutativity requires that the Z's be symmetrically 
distributed since if B^^ ^ B^'^ then rows i and j anti-commute. Consequently, Sg 
differs from graph form only in the possible presence of y s on the diagonal or signs 
on generators. Conjugating by P and Z appropriately remove these offending traits 
yielding a graph-form stabilizer Sg. 
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Figure 5.2: Illustrations of the transformations involved in the constructive proof 
of the local-Clifford equivalence of stabilizer states and graph states. The examples 
given are for a) a full-rank 5-qubit stabilizer and b) a non-commuting set of Pauli 
operators. In each sequence, the third step and beyond are in echelon form. 
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5.2 Compressed Graph-state Construction 

Due to the phenomenon of memory errors, it is usually a good idea to design circuits 
such that they take as few time steps as possible. While I neglect such errors in 
this work, a more compact form of the circuit for graph-state construction is also 
desirable for the development of a fault-tolerant version of the process. 

Using the circuit from Equation 5.3, the construction time for large graph states 
is almost completely determined by r, the number of time steps spent applying ^Z 
gates. Consequently, compression of the circuit can be approached as a matter of 
rearranging '"Z gates to achieve the maximum amount of parallelism, a task made 
possible by the fact that '"Z gates commute with each other. Since qubits cannot (I 
assume) participate in more than one gate at a time, r is bounded below by 2e/n 
where e is the number of '"Z gates in the construction circuit or, equivalently, the 
number of edges in the corresponding graph. Additionally, r < e since one cannot 
possibly do worse than applying one gate per time step. 

Finding the ideal configuration for an arbitrary selection of '"Z gates is an ex- 
cessively ambitious problem, but we can get an idea of the number of time steps 
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required by bounding r in the worst-case scenario. For the problem of compressing 
graph-state circuits, the worst case scenario is that of the complete graph, where 
each node is connected to every other node. The construction of any other graph 
state can be performed in equal or fewer time steps by omitting the appropriate '"Z 
gates from the construction circuit for the graph state corresponding to the complete 
graph. The number of edges in the complete graph is (2) , so we begin knowing that 
n — 1 < r < n{n — l)/2. 

In order to construct the graph state corresponding to the complete graph, a '"Z 
gate must be applied between every pair of qubits. If we think of the qubits as being 
ordered on a ring, this can be accomplished utilizing any pattern of simultaneously 
applicable gates such that for any distance d < [n/2\ (distances d and n — d are 
the same for a ring) exactly one '~^Z connects a pair of qubits separated by distance d. 
Rotating the pattern through one cycle produces all possible pairings exactly once. 
For n = 2a + 1, a workable gate pattern is 

L(a+1)/2J V(a-l)/2\ 

(5.5) 

i=i j=i 

For n = 2a + 2 this pattern does not quite work since it excludes '"Z gates connecting 
qubits at distance a+1 from one another. In order to establish these connections, 
the gate pattern 

a+1 

W'Zi.a+l+i (5.6) 
i=l 

must be applied using one additional round. A compact construction circuit for the 
complete graph on 5 qubits is given in Figure 5.3. 

The circuits just described provide a method of constructing any graph state such 
that all ^Z gates are applied in a number of rounds r < 2 [ri/2j + 1 . For the complete 
graph, these circuits are nearly optimal, and, for typical graphs, which have half as 
many connections, they are roughly within a factor of two of the lower bound on 
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Figure 5.3: a) The complete graph on 5 qubits and b) a compact construction circuit 
of the kind described in the text for the associated state. 



the number of required time steps. For certain interesting classes of graph states, 
however, the constructive procedure given requires a number of time steps far in 
excess of the lower bound. One such case is that of graph states with small, fixed 
generator weight, the graph state equivalent of low-density-parity-check codes [33]. 

For a graph state with generators of weight w, the total number of edges in the 
corresponding graph is e = (w — l)n/2, yielding a lower bound on the number of 
required rounds of r > w — 1. For w <^ n, a. significant gap exists between this bound 
and the number of time steps required by my explicit circuit. The number of rounds 
required is actually at most 2w — 3, as can be seen from the following argument. 

Imagine designing a circuit for constructing a graph state with weight w genera- 
tors by sequentially inserting the necessary '"Z gates into one of 2 (w — 2) + 1 rounds 
of application. It might be impossible to insert a gate of the required set '"Zij into 
any particular round due to qubit i and/or j participating in some other (previously 
inserted) gate. Qubits i and j participate m. w — 1 ^Z gates apiece, however, 
so they may be previously engaged for at most 2{w — 2) rounds. Thus, it is always 
possible to insert the additional '"Z gate into one of the 2(w — 2) + 1 rounds, showing 
that r < 2(w - 2) + 1 = 2w - 3. 
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5.3 Fault-tolerant graph state construction 

The goal of this chapter is to develop a method of constructing large ancillae that 
does not depend on concatenated coding and does not require discarding large num- 
bers of attempts. This is a significant impediment to ancilla construction since, as 
discussed in Section 2.6.5, the primary advantages of constructing ancillae over ap- 
plying operations directly to the data are low starting error rates, exact knowledge 
of what the state ought to be, and the ability discard failed construction attempts. 
My decision to design routines for graph states was based on the hope that their 
simple structure would allow something to be made of the second feature. 



5.3.1 Tracking Errors 

Consider the compact method of n-qubit graph-state construction described in the 
previous section. Let i?* label the pattern of '"Z gates applied in round i of the r 
rounds of '"Z gate apphcation. The circuit for graph state construction can then be 
written as 

1 1 

JjM®"|0)^" = JJi?'^|0), (5.7) 

i=r i=r 

where I am abusing my notation somewhat by allowing the bar overtop to indicate 
states and gates for a collection of unencoded qubits, or, if you prefer, a very boring 
[[r;,,n, 1]] code. Since '"Z gates commute with the control of any controlled-operation, 
this circuit is equivalent to 

^l,r-,2fl^wR' n '^l,MOr^'^"\ (5.8) 
i=r j=r+2 

where I have supplemented the primary ancilla qubits needed for graph-state con- 
struction by a host of secondary ancilla qubits. An example of this circuit for n = 5 
is given in Figure 5.4. 
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Figure 5.4: An example of the collective circuit for graph state preparation with 
error tracking. The circuit shown prepares a graph state on 5-qubits. In order to 
track errors properly, the portion labeled CATS must be prepared separately in a 
way such that correlated errors are rare. 



The purpose behind adding all of those gates is that while Equation (5.8) and 
Equation (5.7) produce the same output in the absence of errors, their action in 
the presence of errors is very different. The extra gates in Equation (5.8) copy 
out X errors at various points in time to the auxiliary ancilla qubits. If the initial 
portion of the circuit, corresponding to the production of n (r + 2)-qubit cat states, 
is implemented without error, then the latter portion serves to track the location of 
X errors on the primary ancilla qubits during the construction. Both the spatial and 
temporal localization of X errors is important since gates convert X errors into 
X ^ Z errors. Contrariwise, Z errors are not tested for at all, reflecting the fact that 
Z errors do not spread. 

There is no hope of making this circuit fault-tolerant in the strict sense defined 
in Section 2.6 since a single failure on any ^'Z gate can generate a pair of Z errors on 
the output state. At best, I can achieve a weaker form of fault-tolerance where rather 
than preventing the spread of errors I limit it to some fixed amount, optimally, to one 
additional location. To achieve this, I need two things: cat states without correlated 
errors and a method of predicting the location of X and Z errors on the primary 
ancilla qubits given the tracking information obtained by measuring the secondary 
ancilla qubits. I do not discuss a means of preparing suitable cat states since I have 
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Figure 5.5: Examples of a) a perfect error track and b) the sort of error track that 
is actually observed. The examples are for 30 primary qubits where the qubits are 
arranged vertically and time runs from left to right. Black indicates an X error. 
Noisy tracks can be obtained by layering from left to right the results of measuring 
the secondary ancilla qubits coupled to after each round of graph-state construction. 



not yet managed to design one. In what follows, I introduce a method of interpreting 
the measurement record and discuss its properties. 



5.3.2 Interpreting Error Tracks 

A typical X error during graph-state construction presents many opportunities for 
its detection. X errors persist unless canceled by other X errors, so a clairvoyant 
view of X errors on the primary ancilla qubits at a sequence of times would tend to 
reveal a collection of streaks, like those illustrated in Figure 5.5a. A noisy version of 
this insight, as in Figure 5.5b can be obtained by measuring the tracking qubits and 
ordering the results by the primary qubit coupled to and the time of coupling. 

Observed error tracks include noise deriving from errors in both cat-state prepa- 
ration and the measurement process which should be ignored when determining the 
location of errors on the primary ancilla qubits. These inconsequential errors are 
generally recognizable in the measurement record by their failure to persist from one 
measurement to another, lending the noise in error tracks an appearance reminiscent 
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of static on an analog television. To accurately infer the locations of errors on the 
constructed graph-state, this static should be filtered from the error tracks. Below 
I develop filtering rules based on the order of various events, that is, the number of 
errors required to make them happen. Whenever two possible events have different 
orders I assume that the one requiring fewer errors actually occurred. 

In brief, the basic algorithm I employ to filter the noise from a set of error tracks 
is as follows. The error track for each primary qubit is filtered separately. The 
track is divided up into segments separated by two or more measurements which 
indicate that no error occurred. Each segment is then taken to be entirely in error 
if I — w + a<w where / and w are the length and Hamming weight of the segment 
and a is 1 if the segment includes the end of the error track and 2 otherwise. This 
algorithm, which I refer to henceforth as the liberal filter, was used to filter the error 
track in Figure 5.5b to obtain Figure 5.5a. Figure 5.6 illustrates its application for 
some informative examples. 

To explain the origin of the liberal filtering routine, it is easiest to adopt a sim- 
plified terminology. In what follows, "track" always refers to the error track (corre- 
sponding to a single primary qubit) being filtered, "black" and "white" describe the 
state of the primary qubit or the outcome of a measurement with black correspond- 
ing to an X error and white to no X error, and the term "interim" is used to denote 
errors that happen between measurements. 

Schematically, the filtering problem is this: An initially white qubit undergoes 
a sequence of r color measurements. An error during a measurement can cause the 
wrong result to be reported and/or invert the color, but an error during the interim 
between measurements can only change the color. There is a minimum number of 
errors that must occur for the observed measurement results to be possible. Our task 
is to find any sequence of colors that can be generated and made consistent with the 
measurement results using the minimal number of errors. 
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At its core, my filtering algorithm relies on an observation about the reliability of 
sequential pairs of measurements. A second order event is necessary for two agreeing 
sequential measurements to be wrong, but at most two interim errors are required 
to produce the same measurement results without any errors in measurement having 
occurred. In the worst case, the pair of measurements disagree with the known color 
on either side, but this can result from two interim errors, namely, inverting the 
color before and after the measurements. Consequently, it is always reasonable to 
assume that a sequence of agreeing measurement results are accurate. My filtering 
algorithm takes advantage of this fact by partitioning each track into segments where 
the boundaries of segments are delineated by pairs of white measurements. 

Partitioning the track simplifies the problem of filtering in two ways. First, we 
may assume that the incoming color is white and, unless the segment includes the 
end of the track, likewise for the outgoing color. Second, the minimal-error scenarios 
consistent with the observed measurements include one of the two following cases: 
the entire segment is black and all the white measurements were wrong or the entire 
segment is white and all the black measurements were wrong. To see why this is so, 
imagine that some portion of the segment is black. The nearest white measurement 
(in the segment) may be interpreted either as signaling that an error has occurred 
bringing the black region to end or, since every white measurement is bracketed by 
a pair of black measurements, that the white measurement was wrong. Thus, any 
scenario involving a portion of the segment being black requires at least as many 
errors as the scenario in which the entire segment is black. The only other scenario 
that must be considered, then, is the one in which the entire segment is white. 

It is straightforward to compare the number of errors required for an entire seg- 
ment to be black versus white. If the entire segment is white, then all of the black 
measurements must have been wrong, so the number of errors is w where, confus- 
ingly, w is the weight of the segment or the number of black errors. If the entire 
segment is black then all I — w white measurements must have been wrong where I 
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(b) HI! 

(c) ■ ■ ■ ■ 

Figure 5.6: Three examples of my standard noise filtering routine for error tracks. 
Each shows the error track initially obtained, the segments considered, and the post- 
filtration track. The filter breaks the track corresponding to each primary qubit into 
segments demarcated by sequential pairs of measurements indicating no error. The 
segments are then judged error-free ifl — w + a>w and entirely in-error otherwise; 
I, w, and a are the length, weight, and number of endpoints (discounting the end of 
the track) of the segment in question. 



is the length of the segment. Additionally, if the entire segment is black then 1 error 
must have occurred to change the color to black at the beginning of the segment 
and, unless the segment contains the end of the track, 1 error must have occurred to 
change the color back to white at the end of the segment. Consequently, I assume 
the entire segment is black when I — w + a < w, where a is 1 if the segment includes 
the end of the track and 2 otherwise. 

After having filtered the tracking data, I translate the filtered error tracks into an 
expected error distribution on the prepared graph state using a simple set of rules. 
First, streaks that include the final round are taken to indicate the presence of an 
X error on the final state. Second, a gate is assumed to have spread a Z error if 
it was applied to a qubit at a time spanned by one of the qubit 's streaks. Third '"Z 
gates applied just before a streak are assumed to have spread Z errors, but '"Z gates 
applied just after one are assumed not to have. Since Z errors do not themselves 
spread, these rules suffice to predict the final error distribution. 



5.3.3 Error spread 



The liberal filtering algorithm was designed to ensure that one of the most probable 
error scenarios consistent with the observed measurements was adopted. This is 
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(a) 



(b) 



Figure 5.7: Circuit fragments for analyzing the fault-tolerance of my graph-state con- 
struction procedure when no track filter is used. Modulo errors, each measurement 
yields the value 1 since the qubits coupled by '"X gates are initially prepared in cat 
states. The fragment b) applies only to the last round of error tracking where the 
measurement is used to determine whether an X error is on the primary qubit. All 
other rounds are described by a) and use the measurements to determine whether Z 
errors were spread in either direction. 



no guarantee, however, that the presumed error actually occurred. To understand 
the fault-tolerance properties of the liberal filter, it is necessary to determine how 
different the error distribution of the output state might be for other error scenarios 
capable of generating the observed measurement results. 

I begin my analysis of error spread by considering the result of interpreting error 
tracks using no filter at all, a case I refer to henceforth as the fool's filter. The 
tremendous advantage of this filter is that it is very easy to analyze its fault-tolerance 
properties. Each measurement outcome is utilized individually and applied to the 
locality where it was obtained, so it is only necessary to consider small pieces of the 
construction circuit. In fact, it is sufficient to analyze the two circuits in Figure 5.7. 

The circuit fragment in Figure 5.7a depicts the operations applied in every round 
of error tracking but the last, which is shown in Figure 5.7b. For each fragment, in 
the absence of errors, measurements yield the outcome 1, while a single X error on a 
primary ancilla qubit propagates to a measurement and is detected as an outcome of 
— 1. Whenever a measurement outcome of —1 is obtained in the circuit in Figure 5.7a 
a Z error is assumed to have been spread to the opposing primary qubit. A Z error 
on either of the secondary ancilla qubits is spread to one of the primary ancilla qubits. 
An X error on either of the secondary ancilla qubits results in a mistaken inference 
(an effective Z error) regarding the opposite primary qubit. 
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Independent of whether there was initially an X error on either of the primary 
qubits, the circuit in Figure 5.7a correctly determines which primary locations Z 
errors were spread to so long as neither the secondary ancilla qubits nor the gates 
introduced additional errors. When one or more other errors occurs, the errors, 
together with diagnosis, may sometimes result in a Z error on each primary qubit in 
the circuit fragment. X errors are irrelevant since they do not impede the remainder 
of the construction circuit and are tested for separately in the final round. Thus, the 
maximum scale-up per error for this circuit fragment is 2. The circuit in Figure 5.7b 
detects an X error on the primary ancilla qubit so long as neither the secondary 
ancilla qubit nor the gates introduced additional errors. When one or more other 
errors occurs, a Y error may result on the primary qubit. Counting X and Z errors 
individually, the maximum scale- up per error is again 2. 

If the error spread of the liberal filtering routine is to be greater than 2, it must 
be for sequences of measurements where it acts differently than the fool's. Thus, 
we need only concern ourselves with segments delineated by pairs of measurements 
indicating no error, excluding those segments of length greater than 2 where all 
measurements are in agreement. Recall that the restriction, using the liberal filter, 
for deciding whether a segment is in error is {l — w) + 2 < w. Suppose that the liberal 
filter completely misidentifies a collection of measurement errors as representing an 
error of length / on the primary qubits. At most this may result in I + 1 Z errors 
{X errors are irrelevant for segments that do not include the end of a track), but, 
in order for the algorithm to have reached this conclusion, it must be the case that 
w > {I + 2)/2, implying that 2w > / + 1, i.e., twice the number of errors is greater 
than the number of errors generated. Now suppose that the liberal filter completely 
misidentifies an error of length I and a collection of measurement errors as simply 
the result of measurement errors. At most, this may result in I + 2 Z errors, but, 
in order for the algorithm to have reached this conclusion, it must be the case that 
l-w + 2>w, implying that 2{l-w + 2) = 2{l + 2) - 2w > 2{l + 2) - {I + 2) = I + 2, 
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i.e., twice the number of errors is greater than or equal to the number of errors 
generated. For the case of a segment including the end of the track, the relevant 
inequality, number of errors generated on the primary ancilla qubits, and number 
of errors that occurred are {I — w) + 1 < w, / + 1, and w for the first case and 
{I — w) + 1 > w , I + 1, and / — w + 1 for the second. The results are the same. 

In conclusion, neither filtering algorithm yields a fault-tolerant construction rou- 
tine since they both permit the generation of correlated errors. The preceding para- 
graphs show, however, that they do limit the spread of errors to 2 per original failure. 



5.4 Numerical Investigations 



5.4.1 Filtering with the Viterbi Algorithm 



The liberal filter interprets the measurement results from a single track as indicating 
an error of the lowest order possible, but, among errors of that order, the particular 
one chosen is rather arbitrary. It is natural to wonder how this filter compares to an 
idealized filter that always yields the most probable error scenario consistent with 
the observed measurements for the entire set of error tracks. Unfortunately, imple- 
menting an ideal filter as a brute-force maximum-likelihood decoder is impractical 
because the number of possible error scenarios grows exponentially in both the num- 
ber of qubits and the number of rounds. It is possible, however, to efficiently find 
the most probable sequence of X error states corresponding to the measurements 
from a single track. The exponential scaling in the number of rounds can be avoided 
by using the Viterbi algorithm to determine the most probable sequence of states. 
The Viterbi algorithm, which is explained in detail in Appendix C, is a method for 
efficiently finding the maximum likelihood path for problems on directed graphs. In 
this section I apply it to the problem of filtering error tracks. 
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Adapting a problem to the Viterbi algorithm is entirely a matter of writing down 
the appropriate graph. The graph must be directed and acyclic, the transition and 
starting probabilities must be known, and, practically speaking, the number of si- 
multaneously relevant states must be manageable. Graphs describing time-ordered 
sequences are easily made to satisfy the first criterion by requiring distinct nodes 
for each state at each time. The states that we care about for the filtering problem 
are the presence or absence, at each time, of an X error on the primary qubit cor- 
responding to the track being filtered. The transition probabilities from one state 
to another follow very simply from the error propagation rules for X and the prob- 
abilities of various gate errors, but only if we additionally include the X-error state 
of each qubit that might transmit an X error to the qubit of interest. It is possible 
to do this without violating the injunction against unmanageable numbers of states 
since the X state of at most one additional qubit is relevant to the evolution of the 
primary qubit at any particular time and each additional qubit interacts only with 
the primary qubit. 

A segment of the resulting directed (time runs to the right) graph is shown 
in Figure 5.8. The graph in Figure 5.8 depicts the allowed state transitions for 
any but the final round of tracking graph-state preparation; in the final round, the 
'"Z transitions are omitted. The graph for multiple rounds can be constructed by 
appending graph segments to each other, merging the hollow nodes on the right of 
each segment with the leftmost solid nodes of the next. The secondary ancilla qubit 
referred to by the second state label changes from one segment to the another, but 
the transitions labeled "Include secondary qubit" erase any record of the previous 
secondary qubit. 

Starting probabilities for the first round of graph-state construction are given at 
the left in Figure 5.8. At the end of each round, paths terminating on a measure- 
ment outcome different from that which was observed are deleted, and the remaining 
unnormalized probabilities are fed into the subsequent round. The transition prob- 
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abilities h, a, 6, c, d, and / denote the probability of generating an X error during 
cat state preparation, on the relevant end of the '"Z gate, on the control end of the 
'"X gate, on the target end of the '"X gate, on both ends of the '"X gate, and during 
measurement. Thus, 

h =Ppx + Ppy 

+ qxY + gyx 

& =Px/ + Pyi + Pxz + Pyz ^^-^^ 
C =P/x + P/y + Pzx + Pzy 

=Pxx + PxY +Pyx+P YY 
f =Pm 

where, as in Chapter 4, pp^, Pah, Qas, and Pm denote the probabilities of preparation, 
'"X gate, '"Z gate, and measurement errors of the kinds indicated by T and AH where 
r ranges over the single-qubit Pauli errors and AS ranges over the two-qubit Pauli 
errors. 

The output of running the Viterbi algorithm on the graph just described is the 
most probable sequence of X error states for the primary qubit given the observed 
data. The locations of Z errors on the prepared graph state are then inferred using 
error propagation and, for indicted '"Z gates, by assuming the most probable failure 
mode consistent with an X error being generated. 

5.4.2 Code and Results 

To check my analytical results and to collect more detailed error information I wrote 
yet another simulation. The code implements the error filters described^ as well 

^The code also implements a conservative filter that I do not discuss because it is only 
marginally different from the liberal filter. 
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Filter 


Fool's 


Liberal 


Viterbi 


Average scale-up 


0.912 


0.499 


0.498 


Max scale-up 


1.5 


1.25 


1.21 



Table 5.1: The average scale-up and the maximum observed scale-up for the fool's, 
liberal, and Viterbi filters. In each case the average number of failures per construc- 
tion was approximately 25. 

as a Monte-Carlo error generation and propagation code complete with functions 
for performing tracking graph-state construction. The cat states necessary for the 
simulation are assumed to be prepared to specifications elsewhere. 

Using this code, I collected data on the error composition resulting from tracking 
preparation of the graph-state corresponding to the complete graph on 200 nodes. 
The total failure probability for each operation was set to 1/4000, and a depolarizing 
error model was employed for all operations except measurements, which produced 
only bit errors. Data was collected for 100000 runs using each of the three filters. A 
normalized histogram of the number of Z errors remaining after preparation is shown 
in Figure 5.9a, and a normalized histogram of the number of X errors remaining 
after preparation is shown (with the no-error column omitted) in Figure 5.9b. Some 
additional statistics are displayed in Table 5.1. 

All filtering algorithms perform substantially better than the maximum possible 
scale-up of 2 predicted for the fool's and liberal filter in Section 5.3.3. As interpreted 
using the fool's filter, graph states prepared with tracking have, on average, a number 
of errors equal to the number of failures that occurred during the preparation. The 
liberal and Viterbi filters both yield graph states with about 50% as many errors as 
actually occurred during construction. These two filters are identical to within the 
margin of error, indicating that the liberal filter is well suited to filtering depolarizing 
errors. 
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Figure 5.9: Histograms showing the percent probabiUty of various numbers of unde- 
tected a) Z and b) X errors surviving the tracking graph-state construction process 
for n = 200 and pdep = 1/4000. Yellow, blue, and green bars identify data for the 
fool's, liberal, and Viterbi filters respectively. On average roughly 25 failures occur 
during tracking construction, so the error scale-up is typically quite modest. 



Chapter 5. Ancilla Construction 



148 



5.5 Scaling 

The purpose of developing a new method of ancilla construction was to permit the 
production of large ancillae with improved overhead costs in terms of qubits, so it is 
important to consider the scaling properties my design. 

First, it must be noted that the approach presented in this chapter is not ap- 
plicable to graph states of arbitrarily large size. The direct approach to ancilla 
construction tends to fail for two reasons. Typically, the limiting factor is the prop- 
agation of errors between qubits during the construction process, a pitfall which I 
have made every effort to avoid. As the number of applied gates grows, however, 
sheer accumulation of uncorrected, independent errors will eventually cause the con- 
struction process to fail. I have made no attempt at correcting uncorrelated errors 
(though incidentally I have done so for X errors), so this effect bounds the size of 
the ancillae which I might produce. 

In each round of tracking graph-state construction, there are four sources from 
which Z errors might be injected into each primary qubit. When preparing the graph 
state corresponding to the complete graph, the total opportunities for each primary 
qubit to suffer a Z error is thus roughly 4n. If the construction is to succeed, however, 
it must be the case that the expected number of Z errors per qubit is much less than 
1. Taking all gates and ancilla qubits to fail with probability p yields the bound 



1 



(5.10) 



Anp <^ 1 




on the size of the complete graph. 



As mentioned in Section 5.2, however, graph states corresponding to graphs with 
{w — 1) connections per node can be constructed using only 2w — 3 rounds. In such 
a construction, the number of opportunities for a primary qubit to suffer a Z error 
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is only about 8w, yielding the bound 



1 



(5.11) 



8wp < 1 




on the weight of the stabilizer generators of the graph state. 

In terms of qubit resources, tracking construction of the complete graph requires 
n roughly n-qubit cat states. By comparison, measuring a single weight n generator 
using Shor's method of syndrome extraction also requires an n-qubit cat state, but 
the measurement must be repeated t + 1 times where t is the number of errors that 
we wish to be able to tolerate without failure. Thus, verifying the complete graph 
state by Shor's method after it has been constructed requires roughly nt n-qubit cat 
states. Similarly, for graph states with weight w generators, tracking construction 
requires n roughly 2'u;-qubit cat states while verification via Shor's method requires 
roughly Mm{nt,nw/2) w-qubit cat states. Generally, therefore, the total number 
of qubits prepared in various cat states will be smaller (often much smaller) for my 
procedure. Shor's method, however, is capable of tolerating cat states with much 
higher frequencies of Z errors, so a fair comparison would require that I include the 
number of qubits needed to make each kind of cat state, a quantity which I do not 
presently know. 



Much work remains to be done on this topic. The construction procedure I have 
developed displays a number of interesting properties: it requires relatively few cat 
states for its implementation, responds very differently to different two-qubit error 
models, constructs and verifies graph states without ever having measured any of 
their stabilizers, and generates states with an enormous asymmetry in the number 
of Z and X errors. On the other hand, it requires higher quality cat states than are 
typically necessary and generates states with weight 2 correlated errors. 



5.6 Analysis 
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The asymmetry between X and Z errors is a especially intriguing property. 
Steane's method of syndrome extraction makes good use of ancillae with exactly 
this sort of asymmetry. Non-trivial graph states cannot also be CSS codes, so this 
particular avenue is closed to me, but a variety of purification strategies are promis- 
ing. The graph state corresponding to the complete graph on an even number of 
nodes, for example, is invariant under the transversal application of H, and can thus 
be prepared with either minimal X or minimal Z errors. Such states bring to mind 
the work of Glancy, Knill, and Vasconcelos [21] who have identified a [[6, 2, 2]] code 
that can correct any single X or Z error so long as it is known which half of the code 
it occurred in. 

The presence of weight 2 correlated errors on the constructed ancillae is a definite 
drawback. Such errors effectively reduce the order of the worst-case correctable 
error on an encoded state by half. Ideally, then, ancillae constructed by the method 
described in this chapter would either be further purified or used in some specialized 
task like Steane-style syndrome extraction. 

In one very pertinent special case, however, further verification might be unnec- 
essary. The graphs associated with CSS code states are bipartite, meaning that the 
graph can be divided into two sets of nodes such that no nodes in the same set are 
connected. To recover a CSS code state from the graph state it is only necessary to 
apply H to the qubits corresponding to all of the nodes in one set. The correlated 
errors left behind by the fool's filter, however, only occur on qubits connected by a 
'"Z gate, and those are only (effectively) Z ® Z errors. Thus, applying H to all of 
the qubits corresponding to one set of the bipartite graph yields correlated errors 
only of the form Z ® X. So long as the phase gate is applied by teleportation, this 
separation is maintained throughout encoded Clifford operations. In such a case, the 
correlations can be ignored since they have no effect on CSS code error correction, 
which separately corrects X and Z errors. 

Finally, the construction of adequate cat states has turned out to be an unex- 
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pectedly troublesome problem. For large cat states, the standard approach involves 
making many pairwise parity measurements, during which the Z error prob- 

ability only builds. For my procedure, n-qubit cat states thus produced must be 
corrected for Z errors when np ^ 1, but effective correction of Z errors in an n- 
qubit cat state is impossible unless np <^ 1. Without a novel technique for creating 
cat states, it is difficult to see how this problem might be resolved, and, without a 
solution, the window of probabilities and qubit numbers for my procedure is small. 
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Chapter 6 



Conclusion 



The primary conclusions of this dissertation are twofold. First, I have found that 
a detailed knowledge of the kinds of errors produced by gate failures is, with one 
possible exception, not particularly useful. Second, I have shown that large ancillae 
prepared in logical basis states are a sufficient resource to permit computation at 
quite high rates of error. Also of interest are the following observations: (i) some 
restricted error models yield small gains in the threshold without any modifications 
to the standard approach to fault tolerance, (ii) improvements in ancilla preparation 
are of little consequence to threshold estimates, (iii) Knill's fault-tolerant procedure 
outperforms that of Steane given ancillae with uncorrelated errors, and (iv) n-qubit 
graph-state construction can be compressed to roughly n time steps. In addition 
to these findings, I develop a general tool for understanding thresholds and a novel 
technique for generating ancillae, an important resource for quantum computation. 

My conclusions regarding unusual error models are based on the results of Chap- 
ters 3 and 4. In Chapter 3, I investigate the impact of knowledge about the error 
model by tailoring a fault-tolerant procedure to a highly structured stochastic error 
channel, namely, symmetric errors. Through bounds and estimates I then ex- 
amine the threshold for quantum computation using this procedure. Comparing my 
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results with threshold estimates for the depolarizing channel and threshold bounds 
for adversarial errors, I find only a small increase in the threshold for my tailored 
procedure and error model of choice. Moreover, I show numerically that the window 
of error models for which the procedure yields an advantage is quite small; adding a 
depolarizing channel at 1/10 the strength of the symmetric errors completely dis- 
rupts my procedure's function. Oddly, such small gains come in spite of the fact that 
my tailored procedure dramatically reduces the frequency of errors on constructed 
ancillae. I resolve this mystery by estimating the threshold given perfect ancillae. 
For the Steane code, these turn out not to give much higher thresholds than an- 
cillae constructed using the standard prepare and discard approach. Thus, in line 
with predictions by Reichardt [ ], I find that improvements in the construction of 
small ancillae are largely irrelevant to the threshold. The increase in the threshold 
that is observed is thus primarily due to the error properties of the gates applied to 
the data. Consequently, the gain would be expected, and is observed, to apply to 
Steane's fault-tolerant method as well. I show in Chapter 4, however, that, effectively 
ignoring ancilla construction, the increase in the threshold due to a quite restricted 
error model is less than a factor of 2. On the other hand, given that I determine 
the threshold coefficient (the ratio of the threshold to the correctable error rate of 
the code) for the depolarizing error channel using Knill's procedure to be .35, it is 
unreasonable to hope for more than a factor of 3. This is because the threshold co- 
efficient corresponding to a single error on the data with probability p is 1; a higher 
coefficient would require that the probability of an error on the data be less than p, 
the probability of a gate error. Thus, for the cases of small ancilla preparation and of 
data gates, I have basically ruled out major gains in the threshold due to expanded 
knowledge of the form taken by gate errors. The remaining potential for improve- 
ments in the threshold due to knowledge of the error model therefore lies either in 
the construction of large ancillae or in error models for which more is known than 
the kind of Pauli errors produced. The work of Knill regarding heralded errors [30] 
is an example of the latter. 
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The utility of large ancillae is demonstrated in Chapter 4. There I bound the 
threshold given the availability of ancillae whose component qubits sport identical, 
uncorrelated error distributions. For ancillary qubit errors that occur with probabil- 
ity on the order of the gate error probability and in the limit that the size of the code 
goes to infinity, I find that these resources permit computation at error rates in excess 
of 1%. While these threshold bounds are only rigorous given the necessary ancillae, I 
observe fair agreement between my thresholds and recent estimates in the literature. 
In addition, I develop a finite version of the algorithm for threshold estimates using 
small codes that yields predictions for the Steane code in accordance with threshold 
estimates derived from the simulation used in Chapter 3. The success of my algorithm 
at threshold estimation depends on the feature of threshold estimates that dampened 
the results of Chapter 3, ancillae have a relatively minor role to play in threshold 
estimation. In addition, to the comparisons between error models discussed in the 
previous paragraph, I also compare the threshold for Knill's fault-tolerant procedure 
and two procedures based on Steane's method. Knill's telecorrection procedure is 
found always to have a higher threshold, a result that is likely to hold so long as 
ancillae with uncorrelated errors are available. 

In the absence of ancillae with identically distributed, uncorrelated errors, how- 
ever, my results from Chapter 4 do not establish rigorous bounds on the threshold, 
and the construction of sufficiently large ancillae is a non-trivial problem. In an 
effort to address this problem, I develop a novel method of ancilla construction in 
Chapter 5. My method employs a compressed form of the standard circuit for con- 
structing graph states, but the gates corresponding to edges are interspersed with 
'"X gates intended to extract X-error information during the process of construction. 
Through post-processing of the collected information, locations of both X errors and 
propagated Z errors are inferred. The process is not fault-tolerant, but I prove that 
each gate failure leads to at most 2 errors on the ancilla, and, numerically, I find that 
the typical error scale-up is small and that ancilla errors are limited almost exclu- 
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sively to Z errors. Given these facts, the prospects for further verification, or even 
direct use in special situations, seem promising. In fact, in Section 5.6 I suggest a 
possible avenue by which verification might be avoided altogether for CSS codestates. 
Ironically, the most difficult part of my ancilla construction procedure may prove to 
be the construction of the cat states necessary to perform it. Otherwise it compares 
favorably to other means of preparing ancillae. 

Even should direct ancilla construction prove impossible, however, the method 
developed in Chapter 4 provides a new tool for studying thresholds. I have found 
it useful for comparative studies because it provides a quick and simple means of 
predicting the outcome of Monte-Carlo threshold estimates on large codes. In addi- 
tion, by simplifying the complexity associated with estimating thresholds, I believe 
it helps to provide insight into the factors that shape and limit them. 
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Appendix A 

Asymptotic correctable error 
fraction for CSS codes 



In reference [2()] Gottesman and Preskill find that the asymptotic correctable error 
fraction for general CSS codes approaches 11%. Their resuh follows from two sepa- 
rate applications of Shannon's noisy channel coding theorem. Since they apply the 
random coding argument to the X stabilizers and the Z stabilizers separately, quan- 
tum mechanics plays a role only by restricting the number total number of stabilizers 
to be less than or equal to the number of qubits, n. As a consequence, the value 
they obtain for the asymptotic correctable error fraction is exactly the maximum 
error rate r for which a classical code with data rate ^ exists, that is, r such that 
.5 = H2{t) = -T log2 r - (1 - r) \og,{l - r), r ^ .11. 

I would like to apply the same argument in this appendix, but the CSS codes 
considered here have the additional property that the X stabilizers can be obtained 
from the Z stabilizers simply by replacing each Z with an X. Since the X and Z 
stabilizers must commute, this restriction corresponds to requiring that the binary 
matrix representing the X (or Z) stabilizer generators, known as the parity check 
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matrix, be dual-contained. To apply Shannon's noisy channel coding theorem^, I 
must show that any 2mm columns of the parity check matrix, where Wm is largest 
weight of any error being corrected, can be treated as though the entries were ran- 
domly and equiprobably assigned values of or 1. In this case, Wm ~ .lln; the 
remainder of this appendix is devoted to showing that .5n columns can be randomly 
assigned. 

Consider the following non-standard way to construct an m x n dual-contained 
parity check matrix, H, where m = f — ne and e <^ 1. Divide the matrix horizontally 
into two matrices of width | and denote them L and R. Now randomly assign the 
entries in L to be or 1 with equal probability. The probability is 2*~^/25 that the 
ith row of L is dependent given that the previous i — 1 rows are independent. The 
total probability that the rows of L are dependent, Pdl? is bounded by the sum of 
these terms. 



As n becomes large, for any fixed e, Pdl rapidly goes to zero and independent 
matrices come to dominate the output. 

Now I move to the problem of assigning R. I require that H be dual-contained, 
that is, that the rows be orthogonal to themselves and each other. Given that the 
rows of L consist of ^—ne independent, randomly chosen vectors, the restriction on H 
can be restated as the requirement that every row of R satisfy a different randomly 
chosen constraint with every other row in R and with itself (or equivalently, the 
vector of all Is). In addition to being random, the constraints are uncorrelated 
because the rows of L are independent. Constructing R one row at a time, the 
number of vectors that satisfy the constraints on the ith row is 2t~* assuming that 
the set of all previous rows and the all Is vector are independent. The probability 

^For a clear, detailed exposition of Shannon's noisy coding theorem for random linear 
codes see Section 14.2 of Reference [32]. 




m 



(A.l) 
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of a binary string satisfying a binary condition with a randomly chosen constraint is 
I since every binary string either satisfies a constraint or satisfies its negation. This 
means that the probabihty of any particular string satisfying i such constraints is 
2~\ Consequently, the probability of picking a dependent vector for the ith row of R 
given that none of the previous i — 1 vectors were dependent is 2'2~Y2t~* = 2*~t . As 
before, this yields a bound on the total probability of the rows of R being dependent, 

m m 

Pdr < 2-t ^ 2^ = 2 • 2-S J2 2'"' < 2""'+\ (A.2) 

i=l i=l 

which goes to zero as n goes to infinity. 

The probability of my matrix construction procedure halting due to the gener- 
ation of dependent rows goes to zero, but that does not necessarily imply that it 
generates all dual-contained parity check matrices. It is conceivable that the cases 
where a dependent vector is chosen, though rare, correspond to many more possible 
H matrices than the cases where an independent vector is chosen. To verify that 
this is not the case, it is sufficient to count the number of matrices generated by my 
procedure and to compare it with the total number of dual-contained mxn matrices. 

The number of possible L matrices generated by my procedure approaches A^l = 
2^, and the number of possible R matrices approaches 

m 

nn ■ nm ^ nm mm , . ^ 

22-* = 2^-^i=i* = 2^-^-^. (A.3) 

i=l 

By comparison, the total number of dual-contained parity check matrices of size 
mxn where all rows are linearly independent is 

m m 2 ^ 

Nu = JJ(2"-^ - 2') = 2""^-^^!^ JJ(1 - 2^^-") = 2""*-"^-? JJ(1 - 2^*-") 

i=l i=l i=l 

(A.4) 

which, of course, approaches 2'^"^~^~^ as n becomes large. 

Having found that Ah — N^N^ in the limit that n goes to infinity, I am now 
free to treat a random H as though as many as half of the columns are filled with 
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randomly generated binary digits. This means that the probabihty that a randomly 
chosen H satisfies H ■ {x + y) = is 2~™ for any two error vectors x and y such 
that X ^ y and the weight of x + y is less than |. Given that, Shannon's noisy 
coding theorem proceeds exactly as it did in reference [2l>]. There exist classical 
dual-contained codes that, with probability approaching 1 as n — oo, correct errors 
on up to 11% of the bits. Consequently, there exist CSS codes capable of correcting 
.lln X errors and a like number of Z errors with arbitrarily high probability. 
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Code 



The code used in this dissertation is available at http://info.phys.unm.edu. The 
function of each file is explained briefly in the remainder of this appendix. 

B.l Monte-Carlo Threshold Estimation Code 

The backbone of my Monte-Carlo threshold estimator is composed of the C files 
mtl9937ar-cok. c, 7QCode.h, 7QCode . cpp, Threshold. h, and Threshold, cpp. The 
file mtl9937ar-cok. c was coded by Takuji Nishimura and Makoto Matsumoto and 
implements a Mersenne Twister pseudorandom number generator. Basic gates and 
functions are defined in 7QCode . cpp for propagating errors using an array of length 
2 arrays of type char where each 2-element array represents the X and Z errors on 
a set of 8 qubits; the necessary declarations are given in 7QCode.h. Threshold. cpp 
contains the code for initializing and managing the simulation and taking data and 
statistics, while Threshold. h declares the functions for implementing encoded gates 
that are obtained from either StandardFTl . cpp or MyFTI . cpp. 

StandardFTI . cpp and MyFTI . cpp implement encoded gates for the fault-tolerant 
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method of Steane and for my own tailored method, respectively. Only one or the 
other can be included on compilation, otherwise the compiler will crash. 

B.2 Homogeneous Ancillae Threshold Code 

The Mathematica notebooks I use for calculating thresholds for homogeneous an- 
cillae are Infinite CSS code CX counter .nb and Finite CSS Code Bounder. nb. 
The first file implements error generation and propagation routines for encoded and 
unencoded gates for the single-coupling Steane, double-coupling Steane, and Knill 
procedures defined in Section 4.5. All possible Pauli errors requiring two or fewer 
failures are stored along with an algebraic representation of their associated proba- 
bilities (also up to second order). Pauli errors are stored as arrays of integers using 
the mapping {I , X,Y, Z} = {0,1,2,3}. The second file contains the code used to 
determine the range of possible threshold estimates for quantum codes with finite 
(small) numbers of qubits. 

B.3 Monte-Carlo Ancilla Construction Code 

GraphStateConstruction.py implements a Monte-Carlo routine for estimating the 
encoded failure probability of my method of graph-state construction using the lan- 
guage Python. That file contains all of the necessary error propagation functions as 
well as functions for performing tracking graph-state construction and interpreting 
the error trace. As in my Mathematica code, Pauli errors are stored as integers. 
The associated file GraphStatePreparationTraceViterbi .py contains code for in- 
terpreting error tracks using the Viterbi algorithm. 
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Appendix C 

The Viterbi Algorithm 



The Viterbi algorithm is a method of determining the most probable sequence of 
hidden states given limited observational data. The algorithm employs a kind of 
message passing routine to efficiently find the most probable sequence. This appendix 
explains the mechanics of the Viterbi algorithm and presents both a worked example 
and functional code. For a general treatment of message passing, the reader is 
referred to Information Theory, Inference, and Learning Algorithms [ .]. Other 
informative and entertaining introductions to the Viterbi algorithm can be found 
online. 



C.l Explanation 

Life is full of situations where it's important to figure out the most likely sequence 
of events based on limited observational data. Given a set of observations {oj} on a 
system occupying an unknown sequence of states, the most probable state sequence, 
or path, is that which maximizes the conditional probability P Sj \ /\ - Oi = Oi^ 
where 5*^ and Oi are random variables labeling elements of the sequence of states 
and the set of observations respectively. Using Bayes' rule this probability can be 
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written in terms of more accessible quantities as 

_p{^o. = o.\^^s,)p{^^s,) 



/\o. 



(C.l) 



P (/\ . Oi = Oi) does not vary during the maximization and can thus be discarded, 
thereby reducing the problem of finding the most likely path to that of maximizing 

p{^^o. = o,\^^s,)p[^^s,). 

While conceptually simple, this maximization is frequently computationally infea- 
sible because the number of possibilities that must be considered grows exponentially 
in the length of the sequence. In certain cases, however, there exist more efficient 
methods of solution than exhaustively searching all possibilities. 

One such case is that of a Markov process, that is, a process in which the state 
of the system at any time t depends on the previous states only in that it depends 
on the state of the system at time t — 1. In terms of conditional probabilities this is 
the statement that 

p(^St \ s,^ =P{St\St^,). (C.2) 
Using this fact it is possible to expand P Sj^ as 



pI/\sA=pIs, 



t-1 



n-1 



f\sAp{/\sA=p{s,\s,.,)p[/\s, 



(C.3) 



If each observation likewise depends only on the state of the system at a single time 
then P (^/\-0i = Oil Aj ^j^ can be expanded as 



pI/\0, = o, /\sA = l[p(0, = Oi\Si 

\ i j / i 



(C.4) 



where, of course, P{Oi = Oi\Si) = 1 if no observation occurs during time step i. 
Applying the identities in Eqs. C.3 and C.4 to P (^/\. Oi = Oi\ /\j S^^ P Sj^ , the 



Appendix C. The Viterbi Algorithm 



165 



probability that we wish to maximize over, yields 



(C.5) 

t~i \ t-i ^ 



= P{Ot = Ot\St)P{St\St-l)P llP{0^ = 0,\S,) 

which can, as we shall see, be maximized in an incremental fashion. 

Suppose {ui}\^i is the most probable sequence of states leading to St = Ut. From 
equation C.5 we know that 

P{}\^^ = j =PiSt = Ut\St^, = P S, = u}j , (C.6) 

implying that the sequence {wj}-!} must be the most probable sequence of states 
leading to St-i = Ut-i- Were it not, there would exist a different sequence 
such that Vt = Ut, Vt-i = Ut-i and 



n-i \ /t-i 



(C.8) 



Ui 



P\^/\S, = v,j >pi^/\S, = u,j. (C.7) 
But this would imply that 

P ||/\ 5, = v}j =PiSt = ut\St^i = ut-i) P (^/\ S, = v}j 

>P{St = ut\St^i = ut-i) P ^/\ = v}j=P (J\ S, = 

contradicting our assumption that the most probable sequence of states leading to 
St = Ut is 

The preceding paragraph shows that, for any Markov process, the most probable 
sequence of states concomitant with a particular set of observations can be calculated 
in a step-wise fashion. The procedure for doing so is known as the Viterbi algorithm 
and determines the most probable path to each state at each time step by starting 
from the most probable path to each state of the previous time step (and the asso- 
ciated probabilities) and calculating which of these paths leads most probably to a 
given state in the current time step. 
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C.2 Example 

Imagine that you are a professor lecturing a class of overworked and drowsy students. 
Long experience experience has taught you that students typically occupy one of two 
states, 'learning' or 'sleeping', and that students have a memory of about 15 minutes. 
For the purpose of assigning participation points, you keep track of which students 
are both in attendance and conscious. Unfortunately, one of your students has taken 
to wearing mirror shades. You can test whether he is awake by asking him a question, 
but asking the same student questions throughout the period would be disruptive. 
Instead, you decide to make a few observations and determine from those his most 
probable sequence of states. 

Over the course of 15 minutes, students who are learning have a 30% chance of 
going to sleep while students who are sleeping have a 20% chance of waking up. 
Additionally, 90% of students are awake (learning) when class starts. Thus, for a 45 
minute lecture, students are modeled succinctly by the graph 




Omin 15min 30min 45min 

where the rows labeled S and L represent the states 'sleeping' and 'learning' respec- 
tively and the columns correspond to the labeled times. 

At the 15 minute mark, you ask your blinkered student a question and receive no 
reply, a response that conscious students offer only 30% of the time. At the end of 

class, however, he promptly stands up and walks out, indicating that he was awake. 
What was the student doing during your lecture? 

On the basis of the information given you should conclude that he was most likely 
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P({S, S, S, S}, O) = P({S, S, S, L}, O) = 0.0128 

P({S, S, L, S}, O) = P({S, S, L, L}, O) = 0.0112 

P({S, L, S, S}, O) = P({S, L, S, L}, O) = 0.00036 

P({S, L, L, S}, O) = P({S, L, L, L}, O) = 0.00294 

P({L, S, S, S}, O) = P({L, S, S, L}, O) = 0.0432 

P({L, S, L, S}, O) = P({L, S, L, L}, O) = 0.0378 

P({L, L, S, S}, O) = P({L, L, S, L}, O) = 0.01134 

P({L, L, L, S}, O) = P({L, L, L, L}, O) = 0.09261 

Table C.l: Brute force approach to finding the student's most probable sequence of 
states. Each probabihty is found by multiplying the probability of the state sequence 
sequence in question by the probability of the observed data given that sequence. 
For example, P({L,L,L,L},0) = (.9) (.7)3 x .3 = .09261. 



Most-likely Path v.s. Time 





Omin 


15min 


30min 


45min 


S path 


S 


L,S 


L,S,S 


? 


probability 


0.1 


0.27 


0.216 





L path 


L 


L,L 


L,L,L 


L,L,L,L 


probability 


0.9 


0.189 


0.1323 


0.09261 



Table C.2: Finding the most probable path using the Viterbi algorithm. Given 
a set of observations, the Viterbi algorithm calculates the most probable path to 

each state at each time starting from the most probable paths to the states of the 
previous time; all probabilities account for the observations made thus far. At the 
15 minute mark, for example, the most probable path to 'learning' is {L,L} since 
the probability of being in state L initially and transitioning to state L is greater 
than the probability of being in state S initially and transitioning to state L; the 
corresponding probability is simply the product of the probability of being in state 
L, the probability of transitioning from there to state L, and the probability of L 
given the measurement, .9 x .7 x .3 = 0.189. 
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awake for the entire lecture. One method of reaching this conclusion is simply to 
enumerate every possible sequence of states and their corresponding probabilities of 
occurring in conjunction with your observations. This is done in table C.l. A more 
elegant, and generally more practical, approach is to apply the Viterbi algorithm 
as illustrated in table C.2. Rather than calculate every probability, the Viterbi 
algorithm calculates for each time step the probabilities associated with an extension 
of the most probable paths from the previous time step. As a consequence, the 
number of values that must be calculated by the Viterbi algorithm scales only linearly 
with the length of the sequence, while the brute force approach requires a number 
of calculations that is exponential in the length. 

Finally, it should be noted that graphs can be constructed that include the mea- 
surement outcome explicitly as a state. This is particularly useful when, as in chap- 
ter 5, the measurement can change the state of the system. If, in our example, 
students sometimes awoke due to being asked a question, then it would be necessary 
to include the result of the question in the graph. 

C.3 Code 

The Python code for implementing a single step of the Viterbi algorithm is given 
below. 

def viterbiStepdViterbi , multiplier) : 

Implements a single step of the Viterbi algorithm.""" 

# IViterbi contains the previous Viterbi probabilities and paths 

# multiplier contains the transition probabilities 
nViterbi = [] # new set of Viterbi probabilities and paths 

for i in xrange (len (multiplier [0] )) : # loop over destination states 
pathMax = None 
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probMax = 

for j in xrange (len (multiplier) ) : # loop over starting states 
(prob, path) = lViterbi[j] 
prob *= multiplier [j] [i] 
if prob > probMax: 

pathMax = path + [i] 
probMax = prob 
nViterbi . append ( (probMax , pathMax) ) 
return nViterbi 



170 



References 



[1] Scott Aaronson and Daniel Gottesman. Improved simulation of stabilizer cir- 
cuits. Phys. Rev. A, 70(5):052328, 2004. 

[2] Dorit Aharonov and Michael Ben-Or. Fault tolerant quantum computation with 
constant error. In Proc. ACM STOC, pages 176-188, 1997. 

[3] Dorit Aharonov and Michael Ben-Or. Fault-tolerant quantum computation with 
constant error rate, 1999. 

[4] Panos Aliferis and Andrew W. Cross. Subsystem fault tolerance with the bacon- 
shor code. Phys. Rev. Lett, 98(22) :220502, 2007. 

[5] Panos Aliferis, Daniel Gottesman, and John Preskill. Quantum accuracy thresh- 
old for concatenated distance-3 codes. Quantum Information and Computation, 
6:97-165, 2006. 

[6] Panos Ahferis and Barbara M. Terhal. Fault-tolerant quantum computation for 
local leakage faults. Quantum Information and Computation, 7:139-156, 2007. 

[7] Dave Bacon. Operator quantum error-correcting subsystems for self-correcting 
quantum memories. Phys. Rev. A, 73(1):012340, 2006. 

[8] Adriano Barenco, Charles H. Bennett, Richard Cleve, David P. DiVincenzo, 
Norman Margolus, Peter Shor, Tycho Sleator, John A. Smolin, and Harald We- 
infurter. Elementary gates for quantum computation. Phys. Rev. A, 52(5):3457- 
3467, Nov 1995. 

[9] Jonathan Barrett, Carlton M. Caves, Bryan Eastin, Matthew B. Elliott, and 
Stefano Pironio. Modeling pauli measurements on graph states with nearest- 
neighbor classical communication. Phys. Rev. A, 75(1):012103, 2007. 

[10] John S. Bell. On the Einstein-Podolsky- Rosen paradox. Physics, 1:195-200, 
1964. 



References 



171 



[11] P. O. Boykin, T. Mor, M. Pulver, V. Roychowdhury, and F. Vatan. On univer- 
sal and fault-tolerant quantum computing: a novel basis and a new constructive 
proof of universality for Shor's basis. In FOCS 1999: 40th Annual IEEE Sym- 
posium on Foundations of Computer Science, pages 486-494, New York, NY, 
October 1999. 

[12] Harry Buhrman, Richard Cleve, Monique Laurent, Noah Linden, Alexander 
Schrijvcr, and Falk Unger. New limits on fault-tolerant quantum computation. 
In FOCS 2006: 47th Annual IEEE Symposium on Foundations of Computer 
Science, pages 411-419, Berkeley, CA, October 2006. 

[13] A. R. Calderbank, E. M. Rains, P. W. Shor, and N. J. A. Sloane. Quantum 
error correction and orthogonal geometry. Phys. Rev. Lett., 78(3):405-408, Jan 
1997. 

[14] A. R. Calderbank and Peter W. Shor. Good quantum error-correcting codes 
exist. Phys. Rev. A, 54:1098, 1996. 

[15] Maarten Van den Nest, Jeroen Dehaene, and Bart De Moor. Graphical descrip- 
tion of the action of local Clifford transformations on graph states. Phys. Rev. 
A, 69(2):022316, 2004. 

[16] David P. DiVincenzo and Peter W. Shor. Fault-tolerant error correction with 
efficient quantum codes. Phys. Rev. Lett, 77(15) :3260-3263, Oct 1996. 

[17] David P. DiVincenzo, Peter W. Shor, and John A. Smolin. Quantum-channel 
capacity of very noisy channels. Phys. Rev. A, 57(2):830-839, Feb 1998. 

[18] Bryan Eastin. Fault-tolerant thresholds for encoded ancillae with homogeneous 
errors. Phys. Rev. A, 75(2):022301, 2007. 

[19] Matthew B. Elliott, Bryan Eastin, and Carlton M. Caves. Graphical description 
of the action of Clifford operators on stabilizer states. 2007. 

[20] Robert G. Gallager. Low-density parity-check codes, 1963. Expanded version 
of 1960 M.I.T. Dissertation. 

[21] S. Glancy, E. Knill, and H. M. Vasconcelos. Entanglement purification of any 
stabihzer state. Phys. Rev. A, 74(3):032319, 2006. 

[22] Daniel Gottesman. Class of quantum error-correcting codes saturating the quan- 
tum hamming bound. Phys. Rev. A, 54(3): 1862-1868, Sep 1996. 

[23] Daniel Gottesman. Stabilizer Codes and Quantum Error Correction. PhD thesis, 
California Institute of Technology, 1997. 



References 



172 



[24] Daniel Gottesman. Theory of fault-tolerant quantum computation. Phys. Rev. 
A, 57(1):127-137, Jan 1998. 

[25] Daniel Gottesman and Isaac L. Chuang. Demonstrating the viability of universal 
quantum computation using teleportation and single-qubit operations. Nature, 
402:390-393, 1999. 

[26] Daniel Gottesman and John Preskill. Secure quantum key distribution using 
squeezed states. Phys. Rev. A, 63(2):022309, Jan 2001. 

[27] Mitsuru Hamada. Reliabihty of Calderbank-Shor-Steane codes and security of 
quantum key distribution. J. Phys. A, 37(34) :8303-8328, 2004. 

[28] Richard W. Hamming. Error detecting and error correcting codes. Bell Syst. 
Tech. J., 29:147-160, 1950. 

[29] Emanuel Knill. Quantum computing with realistically noisy devices. Nature, 
434:39-44, 2005. 

[30] Emanuel Knill. Scalable quantum computing in the presence of large detected- 
error rates. Phys. Rev. A, 71(4):042322, 2005. 

[31] Emanuel Knill, Raymond Laflamme, and Wojcicch Zurek. Resilient quantum 
computation: Error models and thresholds. Proc. R. Soc. London, Ser. A, 
454:365-384, 1998. 

[32] David J. C. MacKay. Information Theory, Inference and Learning Algorithms. 
Cambridge University Press, New York, NY, USA, 2003. 

[33] David J.C. MacKay and Radford M. Neal. Good codes based on very sparse 
matrices. In IMA: IMA Conference on Cryptography and Coding. 

[34] F.J. Mac Williams and N.J. A. Sloane. The Theory of Error- Correcting Codes. 
North-Holland, Amsterdam, The Netherlands, 1977. 

[35] N. David Mermin. What's wrong with these elements of reality? Physics Today, 
pages 9-11, June 1990. 

[36] Garlos Mochon. Anyon computers with smaller groups. Phys. Rev. A, 
69(3):032306, 2004. 

[37] Michael A. Nielsen and Isaac L. Chuang. Quantum computation and quantum 
information. Cambridge University Press, Cambridge, England, 2000. 



References 



173 



[38] Jian-Wei Pan, Dik Bouwmeester, Matthew Daniell, Harald Weinfurter, and 
Anton Zeilinger. Experimental test of quantum nonlocality in three-photon 
Greenberger-Horne-Zeihnger entanglement. Nature, 403:515-519, February 
2000. 

[39] John Preskill. Lecture notes for physics 229: Quantum information and com- 
putation. 

[40] John Preskill. Fault-tolerant Quantum Computation. World Scientific, 1998. 

[41] Robert Raussendorf and Hans J.Briegel. A one-way quantum computer. Phys. 
Rev. Lett, 86(22):5188-5191, May 2001. 

[42] Alexander A. Razborov. An upper bound on the threshold quantum decoherence 
rate. Quantum Information and Computation, 4(3):222-228, 2003. 

[43] Michael Reck, Anton Zeilinger, Herbert J. Bernstein, and Philip Bertani. Exper- 
imental realization of any discrete unitary operator. Phys. Rev. Lett., 73(1) :58- 
61, Jul 1994. 

[44] Ben W. Reichardt. Improved ancilla preparation scheme increases fault-tolerant 
threshold, 2004. 

[45] Ben W. Reichardt. Fault-tolerance threshold for a distance-three quantum code. 
In Automata, Languages and Programming, volume 4051, pages 50-61. Springer 
Berhn, Heidelberg, 2006. 

[46] M. A. Rowe, D. Kielpinski, V. Meyer, C. A. Sackett, W. M. Itano, C. Monroe, 
and D. J. Wineland. Experimental violation of a bell's inequahty with efficient 
detection. Nature, 409:791-794, February 2001. 

[47] Claude E. Shannon. A mathematical theory of communication. Bell Syst. Tech. 
J., 27:379, 1948. 

[48] Peter W. Shor. Scheme for reducing decoherence in quantum memory. Phys. 
Rev. A 52, 52:2493-2496, 1995. 

[49] P.W. Shor. Fault-tolerant quantum computation. Foundations of Computer 
Science, page 56, 1996. 

[50] Andrew M. Steane. Error correcting codes in quantum theory. Phys. Rev. Lett., 
77(5):793-797, Jul 1996. 

[51] Andrew M. Steane. Multiple particle interference and quantum error correction. 
Proc. R. Soc. A, 452:2551, 1996. 



References 



174 



[52] Andrew M. Steane. Space, time, parallelism and noise requirements for reliable 
quantum computing. Fortsch. Phys., 46:443, 1998. 

[53] Andrew M. Steane. Overhead and noise threshold of fault-tolerant quantum 
error correction. Phys. Rev. A, 68(4):042322, 2003. 

[54] Krysta M. Svore, A. W. Cross, Isaac L. Chuang, and A. V. Aho. A flow- 
map model for analyzing pseudothresholds in fault-tolerant quantum computing. 
Quantum Information and Computation, 6(3):193-212, 2006. 

[55] Krysta M. Svore, Barbara M. Terhal, and David P. DiVincenzo. Local fault- 
tolerant quantum computation. Phys. Rev. A, 72(2):022317, 2005. 

[56] Traccy E. Tessier, Carlton M. Caves, Ivan H. Deutsch, Bryan Eastin, and 
Dave Bacon. Optimal classical-communication-assistcd local model of n-qubit 
Greenberger-Horne-Zeilinger correlations. Phys. Rev. A, 72(3):032305, 2005. 

[57] Christof Zalka. Threshold estimate for fault tolerant quantum computation, 
1997. Unpublished, quant-ph/9612028. 



